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TREATMENT OF PAIN THROUGH EXPRESSION OF OPIOID RECEPTORS 

[01] This application claims priority of United States Provisional Application No. 
60/448,663, filed on February 19, 2003, which is herein incorporated by reference in its entirety. 

I. BACKGROUND 

[02] Tissue injury and nerve damage caused by trauma, infection, arthritis or iatrogenic 
procedures can produce inflammation, spontaneous pain and hyperalgesia (Levine JD and Taiwo 
YO, Anesth Prog 1990;37:133-35; Goelet P, et d., Mrtwre. 1986;322(6078):419-22). Furthermore, 
patients affected by temporomandibular joint and/or masticatory muscle (orofacial) pain often suffer 
because dental, surgical and/or pharmacologic therapies do not consistently give adequate symptom 
relief. In fact, according to Public Health Services (PHS) estimates, there are more than 50 million 
Americans who experience chronic pain with 45% seeking medical care at some point in their lives. 
It is also estimated that 40% of pain patients never receive adequate relief. Lipton et al. (Lipton JA, 
et al., JADA 1993;124:1 15-21.) reported that 22% of the population in the United States experienced 
at least one episode of orofacial pain in the last six months. It is estimated that approximately $80 
billion is spent annually to treat pain and that 40% of that is to treat craniofacial pain (Bonica JJ. 
Preface et al., eds. Advances in Pain Research Therapy, Vol 3. New York: Raven Press; 1973:v- 
vii). To date, pharmacological approaches still dominate the clinical pain arena, with only modest 
efforts being directed towards the development of new innovative treatment regimes for the 
management of pain. Disclosed are vectors and methods for reducing pains, such as myalgic and 
arthralgic pains, and such as those in the orofacial region. 

n. SUMMARY 

[03] As embodied and broadly described herein, disclosed herein, in one aspect, are 
vector constructs that comprise sequence encoding a polypeptide for treating pain. Also disclosed 
are methods for treating pain by expressing the ji-opioid receptor protein in nerve cells. 

[04] It is to be understood that both the foregoing general description and the following 
detailed description are exemplary and explanatory only and are not restrictive. 

HI. BRIEF DESCRIPTION OF THE DRAWINGS 

[05] Figure 1 shows the human pt-opioid transient transfection in N2a cells. Figure 1 A 
shows that irnmunoc^ochemistry reveals expression of HUMOR driven by the cytomegalovirus 
CMV promoter in neuronal cells using monoclonal antibody. Figure IB shows mock 
transfected cells. 



WO 2004/073646 



PCTYUS2004/004914 



[05] Figure 2 shows FIV(lacZ) local administration to the temporomandibular joint 
(TMJ) area of the face. Figure 2A shows FIV was injected at the right TMJ with 10 8 infectious 
particles per mL, which receives sensoiy innervation from the mandibular division of the 
trigeminal nerve. Saggital(B) and horizontal (Qsections of the right gasserion (trigeminal) 
ganglion display X-gal positive neuronal cellbodies that were transduced following FlVflacZ) 
injection in the TMJ. 

[06] Figure 3 shows a representation of a lentiviral system containing the HUMOR 
gene. The 3-vector FIV(HUMOR) system. The FIV(HUMOR) lentiviral system is comprised 
of 3 vectors: Packaging vector providing the packaging instructions in trans,- VSV-G envelop 
vector (VSV-G sequence in SEQ ID NO:54) providing the envelop instructions in trans, - and 
FIV(HUMOR) vector containing the therapeutic gene. 

[07] Figure 4 shows a schematic of an exemplary LIV vector carrying a HUMOR 
cassette. 

[08] Figure 5 shows FIV(lacZ) injection (a total of 5X10 6 infectious particles) to the 
right TMJ resulted in widespread infection of hard as well as soft tissues of the joint. (A) 
Sagittal TMJ sections analyzed by p-galactosidase immunohistochemistry and counter-stained 
by nuclear fast red revealed expression of the reporter gene lacZ in the hypertrophic zone of the 
condyle, primarily comprised of cartilaginous cells, (B) as well as in the meniscus, endothelial 
cells and perivascular osteocytes. Panel (C) depicts TMJ sections from a saline injected animal. 
c=condyle; d=disk; m=muscle; v=vessel. 

[09] Figure 6 shows the development of the control FIV(A'Iac) vector with inactive 0- 
galactosidase gene. (A) The reporter gene lacZ was inactivated after deletion of a critical placZ 
DNA fragment containing the P-galactosidase gene transcription initiation site by restriction 
enzyme-mediated excision and re-ligation of the backbone vector. (B) The structure of mutated 
FIV(A'lac) and wild type FlV(lacZ) viral vectors were confirmed by PCR following transient 
transfection into the murine cell line NIH 3T3. The presence of viral DNA in cells was detected 
by a 444 bp DNA band utilizing the "FIV" primers (as depicted in panel A). The complete 
structure of lacZ gene was confirmed by a 1 .7 kb DNA band utilizing die lacZ primers (depicted 
as UP, LP in panel A). In the case of the mutated FTV(A'lac), there was lack of the 1 .7 kb DNA 
band as the annealing site for the lower primer LP was excised. (C) Deletion of the lacZ 
transcription initiation sequence in the FIV(A'lac) resulted in inactivation of the p-galactosidase 
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reporter gene as demonstrated by the lack of X-gal staining compared to (D) the FIV(lacZ) 
vector. 

[10] Figure 7 shows FIV(lacZ) and FIV(A'lac) injections (5X1 0 6 infectious particles) 
in the right TMJ of mice resulted in successful infection of primary sensory neurons located in 
5 the ispilateral trigeminal ganglion. The animals' left side TMJ was not treated (A) The presence 
of backbone FTV DNA in the right trigeminal ganglia ipsilateral to FIV injections was detected 
by a 444 bp DNA band in lanes 1 and 3, utilizing the "FTV" primers (as depicted in panel A), 
suggesting successful transduction of the trigeminal sensory neurons by FIV vectors. Lanes 2 
and 4 do not display any viral DNA as they represent left side ganglia. (B) The inactive form of 
10 P-galactosidase gene in transduced neurons was detected by the absence of the 1 .7 kb DNA 
band (lane i) compared with the wild type gene (lane 3). Lanes 2 and 4 do not display any viral 
DNA as they represent left side ganglia. (C) The successful extraction of genomic DNA from 
left and right ganglia was confirmed by PCR utilizing primers designed for the murine 
housekeeping gene G3PDH (385 bp). 

15 [11] Figure 8 shows injection of FIV(lacZ) in the right TMJ (5X10 6 infectious 

particles) resulted in successful transduction of primary sensory neurons with the reporter gene 
p-galactosidase in trigeminal ganglia ispilateral to the treated joint. (A) p-galactosidase 
expression was detected by X-gal histochemistry in sagittal sections of right-side ganglion (4X), 
(B) primarily at its posterior and posterolateral region (20X). (C) Injection of FIV(A'lac) did not 

20 result in P-galactosidase expression. (D) The X-gal staining was confirmed with 

immunocytochemistry employing antibodies raised against bacterial P-galactosidase following 
FlV(IacZ) injection compared to (E) FrV(A'lac) treatment. 

[12] Figure 9 shows The neuronal cell line N2<x was infected with HIV(HUMOR) [a 
Lenti virus]. Total RNA was extracted from infected and control cells. The levels of HUMOR 
25 and G3PDH transcript were assessed by RT-PCR. Minimal amounts of HUMOR were detected 
in naive cells (CI, C2), as well as cells infected with the HSVQacZ) virus (LI, L2). In contrast, 
HUMOR was readily detected in cells infected with HIV(HUMOR). As a control, the 
housekeeping gene G3PDH transcript was detected in all samples analyzed. 
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IV. DETAILED DESCRIPTION 

[13] Before the present compounds, compositions, articles, devices, and/or methods 
are disclosed and described, it is to be understood that the disclosure is not limited to specific 
synthetic methods or specific recombinant biotechnology methods unless otherwise specified, or 
to particular reagents unless otherwise specified, as such may, of course, vary. It is also to be 
understood that the terminology used herein is for the purpose of describing particular 
embodiments only and is not intended to be limiting. 

[14] Disclosed are the components to be used to prepare the disclosed compositions 
as well as the compositions themselves to be used within the methods disclosed herein. These 
and other materials are disclosed herein, and it is understood that when combinations, subsets, 
interactions, groups, etc. of these materials are disclosed that while specific reference of each 
various individual and collective combinations and permutation of these compounds may not be 
explicitly disclosed, each is specifically contemplated and described herein. For example, if a 
particular jx-opioid receptor vector is disclosed and discussed and a number of modifications 
that can be made to a number of molecules including the ^-opioid receptor vector are discussed, 
specifically contemplated is each and every combination and permutation of the jx-opioid 
receptor vector and the modifications that are possible unless specifically indicated to the 
contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules 
D, E, and F and an example of a combination molecule, A-D is disclosed, then even if each is 
not individually recited each is individually and collectively contemplated meaning 
combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are considered disclosed. 
Likewise, any subset or combination of these is also disclosed. Thus, for example, the sub- 
group of A-E, B-F, and C-E would be considered disclosed. This concept applies to all aspects 
of this application including, but not limited to, steps in methods of making and using the 
disclosed compositions. Thus, if there are a variety of additional steps that can be performed it 
is understood that each of these additional steps can be performed with any specific embodiment 
or combination of embodiments of the disclosed methods. 
A. Definitions 

[15] As used in the specification and the appended claims, the singular forms "a," 
"an" and "the" include plural referents unless the context clearly dictates otherwise. Thus, for 
example, reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, 
and the like. 
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[16] Ranges can be expressed herein as from "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment includes 
from the one particular value and/or to the other particular value. Similarly, when values are 
expressed as approximations, by use of the antecedent "about," it will be understood that the 
5 particular value forms another embodiment It will be further understood that the endpoints of 
each of the ranges are significant both in relation to the other endpoint, and independently of the 
other endpoint. It is also understood that there are a number of values disclosed herein, and that 
each value is also herein disclosed as "about" that particular value in addition to the value itself. 
For example, if the value "10" is disclosed, then "about 10" is also disclosed. It is also 

10 understood that when a value is disclosed that "less than or equal to" the value, "greater than or 
equal to the value" and possible ranges between values are also disclosed, as appropriately 
understood by the skilled artisan. For example, if the value "10" is disclosed the "less than or 
equal to 10"as well as "greater than or equal to 10" as well as "less than" and "greater than" 10 
are also disclosed. It is also understood that the throughout the application, data is provided in a 

15 number of different formats, and that this data, represents endpoints and starting points, and 

ranges for any combination of the data points. For example, if a particular data point "10" and a 
particular data point 1 5 are disclosed, it is understood that greater than, greater than or equal to, 
less than, less than or equal to, and equal to 10 and 1 5 are considered disclosed as well as 
between 10 and 15: 

20 [17] In this specification and in the claims which follow, reference will be made to a 

number of terms which shall be defined to have the following meanings: 

[18] "Optional" or "optionally" means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where said event 
or circumstance occurs and instances where it does not 

25 [19] "Primers" are a subset of probes which are capable of supporting some type of 

enzymatic manipulation and which can hybridize with a target nucleic acid such that the 
enzymatic manipulation can occur. A primer can be made from any combination of nucleotides 
or nucleotide derivatives or analogs available in the art which do not interfere with the 
enzymatic manipulation. 

30 [20] "Probes" are molecules capable of interacting with a target nucleic acid, typically 

in a sequence specific manner, for example through hybridization. The hybridization of nucleic 
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acids is well understood in the art and discussed herein. Typically a probe can be made from 
any combination of nucleotides or nucleotide derivatives or analogs available in the art. 

[21] Throughout this application, various publications are referenced. The disclosures 
of these publications in their entireties are hereby incorporated by reference into this application 
in order to more fully describe the state of the art to which this invention pertains. The 
references disclosed are also individually and specifically incorporated by reference herein at 
least for the material contained in them that is discussed in the sentence in which the reference 
is relied upon. 

B. Compositions and methods 

[22] The ^-opioid receptor (HUMOR) is a key component of the intrinsic anti- 
nociceptive pathway in mammals: descending bulbospinal serotonergic and noradrenergic 
neuronal projections exert anti-nociceptive effects via release of endogenous opioids, which in 
turn activate p-opioid receptors present on the presynaptic membrane of the primary sensory 
neurons. Pain stimulus travels through the nerve to the brain through activation of nociceptors. 
The activation of fi-opioid receptors through binding of opioids interrupts the transmission of ' 
the pain signal. Mammals release endogenous opioids when under pain assault and billions are 
spent each year in the pharmaceutical industry to treat pain through the administration of opioids 
and opioid like molecules that target the ji-opioid receptors. Disclosed herein are compositions 
and methods for the treatment of painj which do not require the administration of p,-opioid 
receptor targeted molecules or utilize lower effective amounts of opioid receptor targeted 
molecules. The disclosed methods involve the over-expression of ji-opioid receptors, which can 
make the nerve cell more receptive to endogenous opioid molecules or to opioids or opioid like 
molecules administered as a pharmaceutical. Over expression of the |>opioid receptors can 
occur through simulation of endogenous opioid receptor genes or through transgenic therapy 
that delivers a construct encoding the opioid receptor. 

[23] Receptor up-regulation is designed to result in circumventing the observed 
desensitization following prolonged opioid drug administration, which in part occurs as a 
decrease in receptor expression. Furthermore, the strategy can take advantage of the existing 
intrinsic anti-nociceptive mechanism by ensuring adequate ^.-opioid receptor presence at the site 
of the central processing of pain. This adequate receptor presence is consistent with heightened 
sensitivity of patients to drags administered exogenously, which is consistent with requiring 
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smaller doses of opioid analgesics, such as Ultram, Fentanil, and Darvon, which otherwise 
commonly result to pathologic addiction. 

[24] Disclosed are compositions and methods to target the expression of opioid 
receptors, such as the ^-opioid receptors, such as human opioid receptors, to sensory neurons 
5 innervating regions that can experience pain, such as orofacial regions that experience 

nociception. Disclosed are compositions and methods for targeting opioid receptors, such as the 
ja-opioid receptors, expression in sensory orofacial neurons. Also disclosed are compositions 
and methods for targeting opioid receptors, such as the ja-opioid receptors, expression in any 
sensory neuron. For example, the compositions and methods can be used in any sensory neuron, 

10 wherein the sensory neuron processes pain or other "input" signals from peripheral tissues (e.g., 
joints, amputated limbs, extracted or endodontically treated teeth), as well as vital organs. For , 
example, disclosed are vectors, such as feline immunodeficiency lenti viral vectors (FIV), rAAV 
vectors, HSV Amplicon, and liposomes for delivery of the opioid receptor DNA. 
Administration of the vectors peripherally to infect those sensory neurons, such as those 

15 innervating the orofacial region, can be performed For example, the vectors can be delivered at 
the point of pain, for example, an extremity, by for example, injection into the extremity. 
Disclosed are vectors, such as FW, rAAV, HSV Amplicon, and liposomes, capable of stably 
transducing terminally differentiated cells, including neurons. 

a) Nervous system 

20 [25] The nervous system can be divided into two parts: central and peripheral. The 

central nervous system consists of the encephalon or brain and the medulla spinalis or spinal 
cord. These two parts, the brain and the spinal cord are continuous with on another at the level 
of the upper border of the atlas vertebra. The peripheral nervous system consists of a series of 
nerves, which connect the central nervous system to all of the tissues in the body. Nerves also 

25 are often grouped as cerebrospinal and sympathetic. However, since the two groups are 

intimately connected and closely intermingled these distinctions are not absolute. Nerve cells 
can also be classified as efferent or afferent nerves. Efferent nerve cells are nerve cells that 
transmit signals from the brain to the periphery and afferent nerve cells are nerve cells that 
transmit signals from the periphery to the brain. 

30 [26] Neurons act as pain pathways and these pathways include peripheral, spinal, and 

supraspinal elements. The peripheral part of the system includes the primary afferent sensory 
neurons. These neurons are called nociceptors, and can be found throughout the body, such as 
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in the skin, muscle, connective tissue, the cardiac system, and abdominal and thoracic viscera. 
Nociceptors are uncapsulated nerve endings that detect thermal, mechanical, or chemical 
stimuli, and are thus, not small molecule receptors. Nociceptors can be thinly myelinated or 
unmyelinated nerve fibers. The thinly myelinated variety are termed A-delta fibers and the 
5 unmyelinated variety are termed C-polymodal fibers. The primary functional difference 

between A and C delta fibers is that A-delta fibers are rapidly conducting and C delta fibers are 
slowly conducting. This means that A delta fibers transmit sensations perceived as fast, sharp, 
well-localized pricking pain, and C-polymodal fibers transmit feeling via thermal, mechanical, 
and chemical stimuli transmitting sensations perceived as dull, aching, burning, poorly localized 
10 pain. 

[27] Most A-delta and the C-polymodal afferent fibers enter the dorsal horn of the 
. spinal cord by way of the dorsal nerve roots and their ganglia. Wide dynamic range neurons • 

receive nociceptive and non-nociceptive input from the skin, muscle, and viscera. This 
. convergence can account for visceral referred pain. Impulses are then transmitted to the brain 
15 . by the spinal thalamic tract (STT). Near the thalamus, the STT bifurcates into the 
. neospinothalamic tract and the paleospinothalamic tract, projecting to the thalamus, 
hypothalamus, periaqueductal gray matter (PAG) in the brain stem. The thalamus processes 
sensory input is projected to the cerebral cortex, basal ganglia, and limbic system. Descending 
pathways conduct transmission from the brain to the spinal cord control and modify afferent 
20 sensory input. 

[28] Nociception can be thought of as the detection of tissue damage by nociceptors. 
Modulation of nociception occurs peripherally, spinally, and supraspinally. Tissue damage is 
associated with the release of chemical mediators, such as serotonin, histamine, bradykinin, 
cytokines, prostaglandins, and leukotrienes, which produce inflammation, and occurs in the 
25 peripheral system. The pain transmission is modulated by these events and this lowers 

excitability threshold of the nociceptor threshold so that stimuli normally non-painful stimuli 
become painful. This is called nociceptor sensitization. Two other substances that sensitize 
nociceptors are substance P and glutamate, which can be released from nerve terminals. 

[29] The signals from the nociceptors are processed in the dorsal horn of the spine. 
30 Repetitive, convergent input from A-delta and C polymodal fibers at the dorsal horn can result 
in a state where less stimulation is required for the generation of a pain response. This is known 
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as the wind-up phenomenon, and is thought to be initiated by the release of substance P and the 
excitatory amino acids glutamate and aspartate, 

[30] The brain also signals the spinal cord to modulate the pain response. The PAG 
region of the brainstem contains high concentrations of opioid receptors, and sends projections 
5 to the rostral medulla and eventually to the dorsal root inhibiting ascending pain impulses. 
Thus, the activation of the opioid receptors interrupts the transmission of the pain signal. 
Descending pathways can also stimulate spinal nociceptive transmission as well. 

b) Pain 

[31] Pain is typically classified into two categories: nociceptive pain (somatic pain) 
10 and neuropathic pain. Nociceptive pain is pain that is sensed after some type of trauma. The 
nociceptive pain is sensed by the "nociceptor" sensory fibers which are connected to the nervous 
system. After an injury to a muscle, soft tissue (ligaments, tendons), bones, joints, or skin (or 
other organs), these sensory fibers are stimulated which causes a transmission of a signal 
through an afferent neuron to the brain. Nociceptive pain is often characterized as a deep 
15 aching, throbbing, gnawing, or sore sensation. Common examples of nociceptive pain include: 
pain after trauma (e.g. a car accident or a fall), postoperative pain, and arthritis pain. 
Nociceptive pain is usually localized and gets better with healing. 

[32] Neuropathic pain is pain caused by damage to nerve tissue. Neuropathic pain is 
often characterized as burning, severe shooting pains, and/or persistent numbness or tingling. 
20 Common examples of neuropathic pain related to back pain include sciatica, pain that travels 
from the spine down the aim, and pain that persists after back surgery. 

[33] It is thought that in some cases prolonged nociceptive pain may progress to 
neuropathic pain, and a patient may have both nociceptive and neuropathic pain at the same 
time. Pain is also often classified as acute pain or chronic pain. Acute pain is characterized as 

25 pain where the amount of pain directly correlates with the level and duration of tissue damage. 
Acute pain therefore, provides a protective reflex, such as the reflex to move your hand 
immediately if you touch a sharp object This type of pain is a symptom of injured or diseased 
tissue, so that when the underlying problem is cured the pain goes away. Acute pain is a form 
of nociceptive pain. Chronic pain on the other hand, does not correlate with the severity of the 

30 insult, and therefore, typically will not serve a protective function. Prolonged damage to tissues, 
i.e. knee pain or tooth ache, will eventually result in plastic (non reversible) changes in the 
neurons that process pain from that area, which now facilitate either allodynia and/or 
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hyperalgesia. Chronic pain is born following these plastic neuronal changes, whereby the 
neurons are now "sick" and pain will occur even in the absence of peripheral stimulus (e.g., 
amputated limbs, extracted teeth). In fact, its basis is neuropathic now, and neurons 
continuously send pain messages to the brain even though there is no continuing tissue damage. 
5 Neuropathic pain is a form of chronic pain. 

(1) Anatomy of orofacial pain 

[34] The mandibular division of the trigeminal nerve provides sensory innervation to 
the TMJ and masticatory muscles. The cell bodies of these primary sensory neurons are located 
in the inferior portion of the trigeminal ganglion extending their unmyelinated (C-fibers) or 
10 thinly myelinated (A5-fibers) peripheral projections to structures of the face and jaws. More 
specifically, nociceptive innervation to the temporomandibular joint (TMJ) is primarily 
provided by the auriculotemporal nerve of the mandibular division of the trigeminal nerve 
(Sessle BJ, Hu JW (1991). Can J Physiol PJiarmacol 69: 617-626). A5 and C nerve fibers, 
whose cell bodies are located in the posterolateral part of the trigeminal ganglion (Yoshino K, et 

15 al. (1 998). Arc Oral Biol; 43 : 679-686), project distally and terminate as non-encapsulated free 
nerve endings dispersed throughout the posterolateral part of the TMJ capsule (Bemick S 
(1962). OralSurg 15:488^92; ThilanderB (1964).^c/a OdontScan 22:151-156; Frommer J, 
Monroe CW (1966). J Dent Res 45:1762-1766; Klineberg I (1971). Ann Royal Coll Surg Engl 
49:268-288), the posterior band of the meniscus and the posterior attachment (Dressen D, et al. 

20 (1990). Acta Anat 139:154-160; Kido MA, et al. (1991). Arch Oral Biol 36:397-400, Kido MA, 
et al. (1993). J Dent Res 72:592-598; Wink CS, et al. (1992). J Oral Maxillofac Surg 
50:334-337). Inflammation, injury or other agents may cause excitation of the free and 
unspecialized nerve endings of the unmyelinated C-fibers, which are predominately involved in 
the transmission of nociception from the TMJ, muscles of mastication as well as the pulp of 

25 teeth. The central projections enter the brain stem via the ventrolateral pons, descend caudally 
as the dorsolateral trigeminal tract and synapse with second order sensory neurons at the 
substantia gelatinosa of the subnucleus caudalis of the descending trigeminal nucleus (medullary 
dorsal horn). Second order sensory neurons extend projections to the nucleus proprius, 
followed by subsequent projections to the intermedial gray, and then to the reticular formation 

30 of the brain stem, and through the intralaminal nuclei of the thalamus project wide spread 

connections into the cortex. The ascending sensory neural architecture is also susceptible to an 
intrinsic opioid-releasing anti-nociceptive descending system, the inhibitory effects of which 
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are mediated by opioid receptors expressed in the presynaptic membrane of the primary sensory 
neurons. Although pain is initially elicited at a peripheral site, it is further centrally modulated, 
i.e. in the brain, enhanced or attenuated, therefore making this aforementioned central 
processing of pain a major component in sensory orofacial nociception. 

5 [35] In the quest for developing new therapies for orofacial pain, gene therapy appears 

to be an emerging treatment method (Kuboki T, et al. (1999). Arc Oral Biol 44: 701-709; Pohl 
M, Braz J (2001). Eur J Pharmacol 429: 39-48; Baum BJ, et al. (2002). JADA 133: 35^4). For 
example, it has been previously suggested that delivery of antisense oligonucleotides developed 
against nociceptive genes to appropriate tissues may offer alternatives in designing novel 
10 treatments for pain management (Wu CL, et al. (2001). Anesthesiology 95: 216-240). 

[36] Disclosed herein, transfer of antinociceptive genes to sensory trigeminal neurons 
innervating Hie orofacial region can be achieved after injection of lentiviral vectors at the 
painful site, such as the TMJ, resulting in their uptake by free nerve endings and retrograde 
transport to the sensory cells' nuclei. Previous studies demonstrated axonal retrograde transport 

15 of horseradish peroxidase from the TMJ to the central nervous system (Romfh JH, et al. (1979). 
Exp Neurol 65: 99-106; Capra NF (1987). Somatosensory Res 4: 201-213) including the 
trigeminal ganglia (Yoshino K, et al. (1998). Arc Oral Biol; 43: 679-686). In evaluating the 
employment of lentiviral vectors as the basis for TMJ gene therapy, as an example, VSV-G 
pseudotyped feline immunodeficiency viral vectors (FIV) were used. These vectors are capable 

20 of stably transducing dividing, growth arrested as well as post-mitotic cells, as it is capable of 
transgene integration into the host's genome (Poeschla EM, et al. (1998). Nature Med 4: 354- 
357). VSV-G pseudotyping of viral vectors confers a broad range of host specificity, including 
human and murine cells, as infection is mediated by the interaction of the viral envelope protein 
and a phospholipid component of the cell membrane leading to membrane-fusion mediated 

25 entry (Burns JQetal. (1993). Proc Natl Acad Sci C/iS4 90: 8033-8037; Carneiro FA, et al. 

(2002). J Virol 76: 3756-3764). Therefore, FIV vectors can mediate sustained gene expression 
in non-dividing terminally differentiated trigeminal sensory neurons, a property unique to 
lentiviral vectors. 

c) Current pharmacologic agents in the management of pain 

30 [37] Non-steroidal anti-inflammatory drugs (NSAID's) are often utilized as the first 

line of agents for the management of pain. NSAID's primarily exert their pain-killing effects by 
inhibiting the production of prostanoids and attenuating peripheral inflammatory conditions that 
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may be responsible for pain elicitation. Alternatively, corticosteroids may be utilized with 
peripheral routes of action. In contrast, exogenously administered opioid drugs (morphine) 
mimic the effects of the endogenous opioids by crossing the blood brain barrier (BBB). 
Similarly, tricyclic antidepressants that cross the BBB have been also employed in cases of 
5 chronic pain by inhibiting the reuptake of serotonin and norepinephrine. However, each of these 
four classes of drugs is characterized by significant side effects that prohibit their long term use 
as well as often show unfavorable treatment outcomes. 

d) Opioid receptors and mechanism of action 

[38] Opioid analgesics have been used for pain management for hundreds of years. 
10 Opium itself consists of the dried latex from the unripe fruit of the opium poppy Papaver 
somniferum. Morphine is isolated from opium. Opioid receptors exist in the spinal and 
supraspinal regions of the nervous systems. Opioids can modulate neuronal transmission by 
binding to opioid receptors in the dorsal-root ganglia, the central terminals of primary afferent 
neurons (LaMotte C, et aL, Brain Res 1976;112:407-12; Fields HL, et aL, Nature 
15 1980;284:351-3) and peripheral sensory-nerve fibers and their terminals (Stein C, et al., Proc 
NatlAcadSciUSA 1990;87:5935-9; Hassan AHS, et al.,. Neurosdence 1993;55:185-95.. 
The dorsal-root ganglia and trigeminal ganglion (Xie GX, et aL, Life Sciences 1999; 64:2029- 
37; Li JL, et aL, Brain Res 1998; 794:347-52.) contain messenger RNA (mRNA) for opioid 
receptors (Schafer M, et aL, Eur J Pharmacol 1995;279: 165-9; Mansour A, et al., Brain Res 

20 1 994;643:245-65) and primary afferent nerves mediate the peripheral antinociceptive effects of 
morphine (Bartho L, et aL, Naunvn Schmiedebergs Arch Pharmacol 1990;342:666-70). The 
presence of endogenous opioids and their receptors are able to produce a potent anti- 
nociception. Opioids increase potassium currents and decrease calcium currents in the cell 
bodies of sensory neurons (Werz MA, Macdonald RL., Neurosci Lett 1983:42:173-ft; Schroeder 

25 JE, et al., Neuron 1991;6:13-20), both of which can lead to the inhibition of neuronal firing and 
transmitter release. A similar process occurring throughout the neuron, can explain why opioids 
attenuate both the excitability of the peripheral nociceptive terminals and the propagation of 
action potentials (Andreev N, et aL, Neuroscience 1994;58:793-8; Russell NJW, et al., Neurosci 
Lett 1987;76:107-12). At central nerve terminals, (Yaksh TL, et aL, Nature 1980;286:155-7) 

30 opioids inhibit the calcium-dependent release of excitatory, pro-inflammatory compounds (e.g., 
substance P) from peripheral sensory-nerve endings, (Yaksh TL., Brain Res 1988 458:319-24) 
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which contribute to the anti-inflammatory actions of opioids (Barber A, Gottschlich R. et aL, 
Med Res Rev 1992;12:525-62). 

[39] There are three known opioid receptors, ju, k, and 8-opioid receptors, \i- 
Receptors are further subdivided into at least two subclasses, pi and ^-receptors. The body 
5 produces opioid like molecules, called endogenous opioids, such as endorphins, enkephalins, 
and dynorphins. preceptors are known to mediate analgesia, respiratory depression, 
bradycardia, nausea/vomiting, and decreased gastrointestinal tone. 

[40] 5-receptors mediate supraspinal analgesia, and K-receptors mediate dysphoria and 
tachycardia. As pain impulses ascend through the spinal and supraspinal nervous system, 
10 activation of the opioid receptors inhibits these impulses and inhibits the transmission of pain 
from the dorsal horn. In addition, opioid analgesics prevent the presynaptic release of pain 
mediators such as Substance P into the spinal cord region. 

[41] All animals, such as mammals, such as human, contain opioid receptors. It is 
understood that there are homologs for the various opioid receptors across animal species. A 
15 number of different opioid receptor sequences are set forth in the SEQ IDS, including ^-opioid 
receptors. The human ji-opioid receptor is referred to herein as HUMOR. It is understood that 
if a particular statement or reference is made regarding HUMOR that this statement is equally 
applicable to homologous receptors, unless specifically indicated otherwise. 

[42] Opioid analgesics are typically grouped into three classes: naturally occurring 
20 (morphine, codeine); semi-synthetic morphine derivatives (hydromorphone, oxycodone, 
hydrocodone); and synthetic. Synthetic opioid analgesics include morphinan derivatives 
(levorphanol); methadone derivatives (methadone, propoxyphene); benzomorphan derivatives 
(pentazocine); and phenylpiperidine derivatives (meperidine, fentanyl, sufentanil, alfentanil, 
remifentanil). Tramadol is an opioid analgesic that also inhibits the reabsorption of 
25 norepinephrine and serotonin. 

[43] Another way to classify opioid analgesics is as agonists, partial agonists, mixed 
agonists/antagonists, and antagonists based on their interactions at the opioid receptors, ji, and 
k opioid-receptors are stimulated by agonists. Partial agonists have reduced jx-opioid receptor 
activity, and mixed agonists/antagonists only stimulate certain \x and K-opioid receptors. 
30 Antagonists bind p. and K-opioid receptors but do not stimulate the receptor activity. 
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[44] Some agonists are Morphine, Hydromorphone, Oxymorphine, Codeine, 
Oxycodone, Hydrocodone, Dihydrocodeine, Methadone, Meperidine, Fentanyl, Sufentanil, 
Alfentanil, and RemifentaniL An example of a partial agonist is Buprenorphine. Pentazocine, 
Nalbuphine, and Butorphanol are examples of mixed agonists/antagonists. Examples of 
antagonists are Naloxone and Nalmefene. It is understood that one way to classify opioid 
receptors is by which molecules act as antagonists and which act as agonists, for example. 
Thus, a receptor can be defined as "a receptor for which morphine is an agonist" 

[45] There are a number of adverse side effects that can occur when taking opioid 
analgesics, such as CNS effects, such as sedation, confusion, and respiratory depression. 
Gastrointestinal adverse effects include nausea, vomiting, and constipation. Involuntary 
muscular contractions (twitching) known as myoclonus, bradycardia, and hypotension, can also 
occur. Lastly, physical and psychological dependence can also occur upon use or prolonged use 
of opioid analgesics. Thus there is a significant need for the disclosed compositions and 
methods, which reduce or eliminate the need for opioid analgesics in many indications. 

e) (j,-opioid receptor therapy 

[46] The disclosed approach for the management of pain involves the targeted 
expression of opioid receptor(s) such as the ji-opioid receptor in the primary neurons 
innervating the areas, such as orofacial areas, experiencing pain, resulting in these same neurons 
becoming more susceptible to the intrinsic opioid antinociceptive mechanisms. Disclosed are 
compositions and methods for treating pain. The compositions comprise an opioid receptor that 
is expressed from a vector. Typically these compositions will be delivered to at the point of 
pain. It is thought that their expression, increases the efficiency of the body's own opioid like 
molecules and decreases pain. 

[47] Disclosed herein, the cDNA for a human ^-opioid receptor (HUMOR) is set 
forth in SEQ ID NO:2. The ^-opioid receptor (Raynor K, et al, J Pharmacol Exp Ther. 1995: 
272:423-8) has been placed into a vector herein and expressed in primary fibroblasts as well as 
cells of the N2a neuronal cell line (Figure 1). Transduction and stable expression of ^-opioid 
receptor in neurons can be accomplished by employing VS V-G pseudotyped immunodeficiency 
viral vectors (FTV). 

[48] The expression of the ji-opioid receptor in the neurons at the point of pain in 
certain embodiments requires transduction in a non-dividing cell such as a neuron. This can be 
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accomplished using a transduction mechanism, such as lipofection or encapsulation methods, or 
via viral vector systems that function with cell division, such a lentiviruses, such as the FIV 
virus, or adeno-associated viruses, rAAV vectors, HSV Amplicon, and liposomes. 

[49] It has been previously shown that this FIV system is capable, due to its lentiviral 
5 properties, of infecting terminally differentiated cells, including neurons. Disclosed are 

methods for administering vectors, such as the FIV(n-opioid receptor) vector, peripherally at the 
site of pain. The neurons innervating that specific site and mediating the nociceptive signals are 
infected and stably transduced. These vectors, including vectors expressing lacZ and the fi~ 
opioid receptor, can transduce nerve cells in vivo, in mice, through injection at the periphery. 

10 [50] Disclosed herein is the stable expression of a reporter gene, the lacZ gene, in 

neurons located in the appropriate region of the trigeminal ganglion following peripheral 
injection of FTV(lacZ) in the area of the TMJ (Figure 2), as well as a variety of expression 
vectors containing the ^.-opioid receptor, such as the human p-opioid receptor. 

[51] . Disclosed are vectors wherein the vector includes sequence encoding the p.- 
15 opioid receptor gene. Also disclosed are vectors, wherein a ji-opioid receptor gene has been 
cloned in an FIV vector. Disclosed are methods comprising administering the disclosed vectors 
to cells, including cells involved in transmitting pain signals, such as nerve cells in the orofacial 
regions, related to for example, pain from TMJ and the masseter muscle. 

[52] Also disclosed are transgenic mice that have been stably transfected with the 
20 disclosed vectors. These mice can be used, for example, as models of pain and the testing of 
therapeutics. 

f) Mouse model of experimental nociception 

[53] The majority of models evaluate reflex increases in jaw muscle activity, 
activating putative nociceptive pathways by the injection of algesic substances. Broton and 

25 Sessle (Boton JG, Sessle BJ, et aL, Arch Oral Biol 1988;33:741-47.) placed hypertonic saline, 
potassium chloride and histamine into the TMJ of cats and found increase in electromyographic 
(EMG) activity in the temporal, digastric and genioglossus muscles. This suggests that the 
products of tissue injury and or inflammation are capable of producing pain within the TMJ, 
causing excitation of sensory afferents and reflex muscle activity. Tambelietal. (TambeliCH, 

30 et aL, J Dent Res 1997; 76: (Special Issue) abstr# 1263.) injected mustard oil unilaterally into the 
TMJ and the results were similar to Broton and Sessle (Boton JG, Sessle BJ, et aL, Arch Oral 
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Biol 1988;33:741-47.). Increases in masticatory muscle activity have been also demonstrated by 
injection of excitatory amino acids into the TMJ area. Cairns et al. (Cairns BE, et al., J 
Neurosci 18;1998:8056-64. Cairns BE, et al., J Neurophvsiol 1999: 81: 1966-69.) injected 
glutamate into the TMJ area of rats and in a dose dependent manner induced a prolonged 
5 increase in EMG activity by the excitation of nociceptors (A delta and C fibers) (Cairns BE, et 
al., J Neurophvsiol 2001: 86: 782-90). Cairns etal. (Cairns BE, et al.. J Neurophvsiol 
85;2001 :2446-54.) placed glutamate into the masseter muscle of the rat and human and evoked 
higher masseter muscle activity in female than males of both species. These results indicate that 
peripheral insult can produce pain behavior and changes in resting muscle activity when 

10 algogenic substances are injected into the TMJ and masticatory muscles. Disclosed are 

behavioral models to assess orofacial pain from the TMJ and masseter muscle via application of 
glutamate into the TMJ and masticatory muscles. These models can include mice, disclosed 
herein, which have had vectors encoding ^-opioid receptor stably transduced. Resistance to jaw 
opening and EMG activity can serve as behavioral measures of pain. 

15 C. Compositions 

1. opioid receptors 

[54] There are typically considered three classes of opioid receptor ji, 8 and k. Genes 
encoding for these receptors have been cloned (Evans et al (1992) Science 258 1952; Kieffer et 
al (1992) Proc.Natl.Acad.Sci.USA 89 12048; Chen et al (1993) MoLPharmacol. 44 8; and 

20 Minami et al (1993) FEBS Lett 329 291 all of which are herein incorporated by reference for 
material related to opioid receptors and there sequence). In addition, an orphan receptor was 
identified which has a high degree of homology to the known opioid receptors and based on 
structural grounds it is considered a receptor called ORL1 (opioid receptor-like) (Mollereau et al 
(1994) FEBS Lett. 341 33, herein incorporated by reference for material related to opioid 

25 receptors and there sequence). Since the cloned receptors function as opioid receptors, by for 
example interacting with pertussis toxin-sensitive G-proteins, all of the cloned opioid receptors 
possess the same general structure which includes an extracellular N-terminal region, seven 
transmembrane domains and intracellular C-terminal tail structure. Evidence obtained from 
pharmacokinetic and activity data indicate there are subtypes of each receptor and other types, 

30 such as less well-characterized opioid receptors, such as e, \, i, which are known. One way 
of characterizing the different receptor subtypes for ji-, 5- and K-receptors is through different 
post-translational modifications of the gene product (glycosylation, palmitoylation, 
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phosphorylation, etc). Also receptor dimerization to form homomeric and heteromeric 
complexes or from interaction of the gene product with associated proteins such as. RAMPs can ' 
effect function, and thus represent another way to characterize the receptors. Different opioids 
have different affinity for the different opioid receptors. For example, /i-morphine, 5- 
leukenkephalin metenkephalin, /e-dynorphin, ^-endorphin, have different affinities for 
thevarious opioid receptors. 

a) ^-Receptor subtypes 

[55] The MOR-1 gene, encoding for one form of the ^-receptor, shows approximately 
50-70% homology to the genes encoding for the S-(DOR-l), k-(KOR-1) and orphan (ORL1) 
receptors. Two different splice variants of the MOR-1 gene have been cloned, and they differ 
by 8 amino acids in the C-terminal tail which are either present or not. The splice variants 
exhibit differences in their rate of onset and recovery from agonist-induced internalization but 
their pharmacology does not appear to differ in ligand binding assays. A MOR-1 knockout 
mouse has been made and the mouse does not respond to morphine, by failing to alleviate pain, 
and by failing to exhibit positive reinforcing properties or an ability to induce physical 
dependence in the absence of the MOR-1 gene. This indicates that at least in this species, 
morphine's analgesia is not mediated through 5- or K-receptors. (Matthes et al (1996) Nature 
383 818). 

[56] The p. receptor is divided into the fil and fil groups. The division occurs because 
of binding and pharmaco activity studies which indicate, for example, that naloxazone and 
naloxonazine abolish the binding of radioligands to the nl-site, and in vivo studies showed that 
naloxazone selectively blocked morphine-induced antinociception but did not block morphine- 
induced respiratory depression or the induction of morphine dependence, indicating different 
types of pi-receptor (Ling et al (1984) Science 226 462 and Ling et al (1985) 
JJ > harmacol.Exp.Ther. 232 149). Subsequent work in other laboratories has failed to confirm 
this classification. 

[57] Peptide sequences of the human and mouse \x receptor are set forth in SEQ ID 
Nos 1 and 3 respectively. 

[58] There is also data consistent with a third form of \i receptor where analogues of 
morphine with substitutions at the 6 position (e.g. morphine-6b-glucuronide, heroin and 6- 
acetjd morphine) are agonists, but with which morphine itself does not interact (Rossi et al 
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(1996) Neuroscience Letters 2161, herein incorporated by reference for material at least related 
to opioid receptors and their function and structure). Antinociception tests on mice show that 
morphine does not exhibit cross tolerance with morphine-6b-glucuronide, heroin or 6-acetyl 
morphine. Furthermore, in mice of the CXBX strain morphine is a poor antinociceptive agent 
5 whereas morphine-6b-glucuronide, heroin and 6-acetyl morphine are all potently 
antinociceptive. Heroin and morphine-6-glucuronide, but not morphine, still produce 
antinociception in MOR-1 knockout mice in which the disruption in the MOR-1 gene was 
engineered in exon-1 (Schuller et al (1999) Nature Neuroscience 2 151). Furthermore, all three 
agonists were ineffective as antinociceptive agents, in MOR-1 knockout mice in which exon-2, 
10 not exon-1, had been disrupted. This indicates that the antinociceptive actions of heroin and 
morphine-6-glucuronide in the exon-1 MOR-1 mutant mice are mediated through a receptor 
produced from an alternative transcript of the MOR-1 gene differing from the MOR-1 gene 
product, the ji-opioid receptor, in the exon-1 region. 

b) 5-Receptor subtypes 

15 [59] Only one 5-receptor gene has been cloned (DOR-1), but overlapping 

subdivisions of 8-receptor have been proposed (51/52 and 5cx/Sncx) on the basis of in vivo and 
in vitro pharmacological experiments. 

[60] The 5 receptor subclasses arise from pharmacological studies. Results from in 
vivo rodent studies are shown in Table 1. 

20 [61] Table 1. 





Agonist 


Competitive antagonist 


Non-competitive 
antagonist 


51 


DPDPE / DADLE 


BNTX (7- 

benzylidenenaltrexone) 


DALCE (p- 
Ala2, D- 

Leu5]enkephalyl- 
Cys) 


52 


Deltorphinll/ 
DSLET 


Naltriben 


5'-NTn 
(naltrindole 5'- 
isothiocyanate 



[62] There are a number of different ligands for the opioid receptors which 



differentially bind one or more receptors. Examples of these ligands are shown in Table 2. 



Receptor type 


^.-Receptor 


5 -Receptor 


k -Receptor 


ORL, 
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Selective agonists 


endomorphin- 

lendomoiphin- 

2DAMGO 


[D-Ala 2 ]- 
deltoiphin I 
[D-Ala 2 ]- 
deltorphin II 
DPDPE 
SNC 80 


enadoline 

U-50488 

U-69593 


nociceptin / 
OFQAc- 
RYYRWK- 
NH 2 * 


Selective 

antagonists 

CTAP 
naltrindole 

UPP 


ylCI 174864 
nor 


binaltorphimine 


Selective 

antagonists 

CTAP 
naltrindole 

TIPP 


None as yet** 


Radioligands 


[3H] 


DAMGO 

[3H] 


[ 3 H] -enadoline 
[ 3 H]-U69593 


nociceptin 



[63] Table 2 



[64] The pharmacological properties of the cloned DOR-1 receptor are somewhere 
between those predicted for either the 81 or 82 subtypes. Mouse and human recombinant 
receptors both bind DPDPE and deltorphin II, which can displacer of [3H]-diprenorphine. This 
5 is different than either a 81 or 82 classification (Law et al (1994) J.Pharmacol.Exp.Ther. 271 
1686). [3H]-diprenorphine, binding to the mouse recombinant receptor, however, is more highly 
displaced by naltriben than BNTX, consistent with it being 82 like. 

[65] Opioid receptors have also been indicated to be in complex ji-receptors and k- 
receptors. For example, one type of 8 receptor subtypes complexes, Sex, and another appears 
10 not to complex, 8ncx (Rothman et al (1993) In: Handbook Exp.Phannacol. Ed. A. Herz 104/1 
p217). 

c) K-Receptor 

[66] The cloned K-Receptor has the sequence set forth in SEQ ID NO: 5, which 
represents an example of a /c-receptor. 

15 d) The orphan opioid receptor 

[67] The orphan receptor has been identified hi three species: rat, mouse and man, all 
having a greater than 90% identity with each other. This receptor is typically referred to as 
ORL-1 for orphan receptor like 1. The endogenous peptide agonist for ORL1 is known as 1 
nociceptin or orphanin FQ. While the ORL1 receptor has structural homology to oiphan 
20 receptors it does not have.pharmacological homology. Non-selective ligands that exhibit high 
affinity for all fi-, k- and 8-receptors, have very low affinity for the ORL1 receptor. Comparison 
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of the deduced amino-acid sequences of the four receptors highlights structural differences 
consistent with the lack of coligand binding. The trans-membrane regions are conserved near 
their top in the p.-, k- and 6-receptors, but are altered in ORLL Site-directed mutants of ORL1 
towards the traditional receptors (rat) are able to bind the traditional receptor's ligands. For 
example, benzomorphan bremazocine binds ORL1 by changing Ala213 in TM5 to the 
conserved Lys of m k and 8, or by changing the Val-Gln-Val276-278 sequence of TM6 to the 
conserved fle-His-Ile motif (Meng et al (1996) IBiol.Chem. 271 32016). There are also a 
number of splice variants of the ORL1 receptor, XOR (Wang et al (1994) FEBS Lett. 348 75) 
with a long form (XOR1L) containing an additional 28 amino acids in the third extracellular 
loop and in the homologous receptor from mouse, KOR-3, five splice variants have been 
reported to date (Pan et al (1998) FEBS Lett. 435 65). 

e) Endogenous Ligands 

[68] In mammals the endogenous opioid peptides are mainly derived from four 
precursors: proopiomelanocortin, pro-enkephalin, pro-dynorphin and pro-nociceptin/orphanin 
FQ. Nociceptin/orphanin FQ is processed from pro-nociceptin/orphanin FQ and is the 
endogenous ligand for the ORLl-receptor; it has little affinity for the n~, 5- and K-receptors. 
Table 3 sets forth endogenous ligands for the opioid receptors. These peptides bind \i, S- and k- 
receptors with different affinity, and have negligible affinity for ORLl-receptors, but none binds 
exclusively to one opioid receptor type. 0-endorphin is equiactive at p.-and 8-receptors with 
much lower affinity for K-receptors; the post-translational product, N-acetyl-p-endorphin, has 
very low affinity for any of the opioid receptors. [Met]- and [Leu]enkephalin have high 
affinities for 5-receptors, ten-fold lower affinities for ^-receptors and negligible affinity for k- 
receptors. Other products of processing of pro-enkephalin, which are N-terminal extensions of 
[Met]enkephalin, have a decreased preference for the 5-receptor with some products, e.g. 
metorphamide displaying highest affinity for the fi-receptor. The opioid fragments of pro- 
dynorphin, particularly dynorphin A and dynorphin B, have high affinity for K-receptors but also 
have significant affinity for p.- and 5-receptors. 

[69] Endomorphin-1 and endomorphin-2 are putative products of an as yet 
unidentified precursor, that have been proposed to be the endogenous ligands for the fi-receptor 
where they are highly selective. The endomorphins are amidated tetrapeptides and are 
structurally unrelated to the other endogenous opioid peptides (Table 3). Although the study of 
the cellular localization of these peptides is at an early stage, endomorphin-2 is found in discrete 
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regions of rat brain, some of which are known to contain high concentrations of ^-receptors. 
Endomorphin-2 is also present in primary sensory neurones and the dorsal horn of the spinal 
cord where it could function to modulate nociceptive input 

[70] In comparison to the mainly non-selective mammalian opioid peptides (the 
5 exceptions being the endomorphins), amphibian skin contains two families of D-amino acid- 
containing peptides that are selective for p.- or 5-receptors. Dermorphin is a ^-selective 
heptapeptide Tyr-D-Ala-Phe-Gly-Tyr-Pro-Ser-NH2 without significant affinity at d- and k- 
receptors. In contrast, the deltorphins - deltorphin (dermenkephalin; Tyr-D-Met-Phe-His-Leu- 
Met-Asp-NH2), [D-Ala2]-deltoiphin I and [D-Ala2]-deltorphin H (Tyr-D-Ala-Phe-Xaa-Val- 
10 Val-Gly-NH2, where Xaa is Asp or Glu respectively) - are highly selective for 5-opioid 
receptors. Table 3 shows a variety of endogenous opioid receptor molecules. 



[71] Table 3. 



Precursor 


Endogenous peptide 


Amino acid sequence 


Pro-opiomelanocortin 


P-Endorphin 


YGGFMTSEKSQTPLVTL- 
FKNAHKNAYKKGE 


Pro-enkephalin 


[Met]enkephalin 
[Leu]enkephalin 

Metorphamide 


YGGFM . 

YGGFL 

YGGFMRF 

YGGFMRGL 

YGGFMRRV-NH 2 


Pro-dynorphin 


Dynorphin A 
Dynorphin A(l-8) 
Dynorphin B 
a-neoendorphin 
P-neoendorphin 


YGGFLRRIRPKLKWDNQ 

YGGFLRRI 

YGGFLRRQFKWT 

YGGFLRKYPK 

YGGFLRKYP 


Pro-nociceptin / OFQ 


Nociceptin 


FGGFTGARKSARKLANQ 


Pro-endomorphin* 


Endomorphin-lEndomorphin- 
2 


YPWF-NH 2 
YPFF-NH 2 



[72] Opioid receptor activation produces a wide array of cellular responses (Table 4). 
For example, there are Direct G-protein bg or a subunit-mediated effects, such as activation of 



15 an inwardly rectifying potassium channel, inhibition of voltage operated calcium channels (N, P, 
Q and R type), inhibition of adenylyl cyclase, Responses of unknown intermediate mechanism, 
activation of PLA2, activation of PLC b (possibly direct G protein bg subunit activation), 
activation of MAPKinase, activation of large conductance calcium-activated potassium 
channels, activation of L type voltage operated calcium channels, inhibition of T type voltage 

20 operated calcium channels, and direct inhibition of transmitter exocytosis. There are also 
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responses in other effector pathways, such as activation of voltage-sensitive potassium channels 
(activation of PLA2), inhibition of M channels (activation of PLA2), inhibition of the 
hyperpolarisation-activated cation channel (Ih) (reduction in cAMP levels following inhibition 
of adenylyl cyclase), elevation of intracellular free calcium levels (activation of PLCb, 
5 activation of L type voltage operated calcium conductance), potentiation of NMDA currents 
(activation of protein kinase C), inhibition of transmitter release (inhibition of adenylyl cyclase, 
activation of potassium channels and inhibition of voltage operated calcium channels), 
decreases in neuronal excitability (activation of potassium channels), increases in neuronal 
firing rate (inhibition of inhibitory transmitter release - disinhibition), and changes in gene 
10 expression (long-term changes in adenylyl cyclase activity, elevation of intracellular calcium 
levels, activation of cAMP response element binding protein (CREB)). 

2. Compositions for treating pain 

[73] Disclosed are constructs and vectors for expressing one or more opioid receptors 
in a cell, such as a nerve cell, such as a peripheral nerve cell. As discussed herein, opioid 

1 5 receptors are typically expressed in the spinal or supraspinal nerve cells, and the periphery. 

typically do not express these receptors. The disclosed compositions and methods are designed 
to express the opioid receptors in nerve cells which are damaged or transmitting because of 
trauma, but which do not have endogenous opioid receptors or insufficient numbers of 
endogenous receptors to react to the endogenous opioid like molecules, typically in the 

20 periphery of the nerve cell. Thus, the expression of the opioid receptors in the nerve cell near the 
point of pain, will typically increase the amount of opioid receptors in this area and thus, 
increase the responsiveness to endogenous opioid like molecules. By expression of the opioid 
receptors, the sensation of pain can be reduced, not by administration of opioid analgesics, but 
rather by more efficient use of endogenous opioid like compounds. It is understood, however, 

25 that opioids, opioid like molecules, and/br other pain alleviating molecules can be added in 
addition to the disclosed opioid receptors. 

[74] Disclosed are methods wherein administration occurs in the intra-articular region 
of the jaw. The results shown herein demonstrated that intra-articular injection of FIV(lacZ) 
resulted in successful gene transfer to articular TMJ surfaces as well as the joint meniscus. 
30 Thus, disclosed are methods, wherein the administration of the disclosed vectors, results in 
delivery to the articular TMJ surfaces and the joint meniscus. 
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[75] Nociceptive innervation to the temporomandibular joint (TMJ) is primarily 
provided by the auriculotemporal nerve of the mandibular division of the trigeminal nerve 
(Sessle & Wu, 1991). A5 and C nerve fibers, whose cell bodies are located in the posterolateral 
part of the trigeminal ganglion (Yoshino et aL, 1998), project distally and terminate as non- 
5 encapsulated free nerve endings dispersed throughout the posterolateral part of the TMJ capsule 
(Bernick, 1962; Thilander, 1964; Frommer & Monroe, 1966; Klineberg, 1971), the posterior 
band of the meniscus and the posterior attachment (Dressen et aL, 1990; Kido et aL, 1991, 1993; 
Wink et aL, 1992). Transfer of antinociceptive genes to sensory trigeminal neurons innervating 
the orofacial region can be achieved after injection of lentiviral vectors at the painful site, such 
10 as the TMJ, resulting in their uptake by free nerve endings and retrograde transport to the 

sensory cells' nuclei. Previous studies demonstrated axonal retrograde transport of horseradish 
peroxidase from the TMJ to the central nervous system (Romfh et aL, 1 979; Carpa, 1 987) 
including the trigeminal ganglia (Y oshino et aL, 1 998). 

[76] Disclosed are constructs capable of expressing any of the opioid receptor gene 
15 products. 

[77] Disclosed are constructs capable of expressing opioid receptors, such as the jx- 
opioid receptor gene product. The fx-opioid receptor construct allows for synthesis of jx-opioid 
receptor protein. 

[78] The fi-opioid receptor construct typically comprises three parts: 1) a promoter, 2) 
20 the ^i-opioid receptor coding sequence, and 3) polyA tail. The poly A tail can be from the 

bovine growth hormone or any polyA tail including synthetic poly A tails. The Bovine growth 
hormone poly A tail carries elements that not only increase expression, but also increase 
stability of any gene construct These three parts can be integrated into any vector delivery 
system, which is capable of transducing terminally differentiated cells, such as nerve cells. 

25 [79] The promoter can be any promoter, such as those discussed herein. It is 

understood as discussed herein that there are functional variants of opioid receptors, such as the 
^.-opioid receptor protein which can be made. In certain embodiments the promoter is going to 
be a cell specific promoter, such as a nerve cell specific promoter, such as the neuron specific 
enolase promoter. Other promoters are disclosed herein. 

30 [80] The promoter can be any promoter, such as those discussed herein. It is 

understood as discussed herein that there are functional variants of opioid receptors, such as the 
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ji-opioid receptor protein which can be made. In certain embodiments the promoter is going to 
be a cell specific promoter, such as a nerve cell specific promoter, such as the neuron specific 
enolase promoter. 

[8 1 ] ^-opioid receptor cDNA can be obtained from the American Tissue Culture 
5 Collection. (American Tissue Culture Collection, Manassas, VA 201 10-2209; n-opioid receptor 
ATCC#. RaynorK, etal., Characterization of the cloned human mu opioid receptor. J 
Pharmacol Exp Ther. 1995;272:423-8.) 

[82] Also disclosed are constructs encoding for the human or mouse ^.-opioid 
receptor, as well as the (3-galactosidase reporter gene (lacZ). 

10 [83] Disclosed are nucleic acids comprising sequence encoding ^i-opioid receptor. 

Also disclosed are nucleic acids, wherein the nucleic acid further comprises a promoter 
sequence, wherein the ^-opioid receptor has at least 80% identity to the sequence set forth in 
SEQ ID NO:2 or 4, wherein the ji-opioid receptor has at least 85% identity to the sequence set 
forth in SEQ ID NO: 1 or 3, wherein the ^-opioid receptor has at least 90% identity to the 

15 sequence set forth in SEQ ID NO: 1 or 3, wherein the ji-opioid receptor has at least 95% identity 
to the sequence set forth in SEQ ID NO: 1 or 3, and/or wherein the ^-opioid receptor has the 
sequence set forth in SEQ ID NO: 1 or 3. 

[84] Also disclosed are vectors comprising the disclosed nucleic acids. Also 
disclosed are cells comprising the disclosed nucleic acids and vectors. Any cell can be targeted 
20 with the disclosed constructs. However, nerve cells, for example, are terminally differentiated. 
This means that they are no longer dividing. The state of a mature non-dividing nerve cell can 
define terminally differentiated cells. In terms of differentiated\stable transduction, nerve cells 
thus represent attractive targets because once DNA is integrated, there is a very low probability 
that it will not remain in the cell. 

25 [85] Also disclosed are non-human mammals comprising the disclosed nucleic acids, 

vectors, and cells disclosed herein. 

[86] Also disclosed are methods of providing ^.-opioid receptor in a cell comprising 
transfecting the cell with the nucleic acids. 

[87] Also disclosed are method of delivering the disclosed compositions, wherein the 
30 transfection occurs in vitro or in vivo. 
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[88] Disclosed are methods of making a transgenic organism comprising 
administering the disclosed nucleic acids, vectors and/or cells. 

[89], Disclosed are methods of making a transgenic organism comprising transfecting 
a lentiviral vector to the organism at during a perinatal stage of the organism's development. 
5 Stragtegies of producing genetically engineered pluripotent, such as embryonic, stem cells, can 
be performed with the disclosed compositions to produce engineered cells and organisms as 
dicussed herein. Furthermore by cloning strategies can be used to produce desried organisms, 
which cany one or more of the disclosed compositions. 

[90] Also disclosed are methods of treating a subject having pain comprising 
10 administering any of the disclosed compounds and compositions. 

[9 1 ] Delivery of the disclosed constructs to terminally differentiated cells is also 
disclosed. Disclosed is a pseudotyped feline immunodeficiency virus (FIV) for p-opioid 
receptor delivery to terminally differentiated cells. Stable expression of the therapeutic gene 
aids prolonged expression, enhancing treatment efficacy and contributing to long-term 
1 5 therapeutic outcomes. The backbone FIV system has been shown to effectively incorporate, due 
to its lentiviral properties, the transgene of interest into the host's genome, allowing for stable 
gene expression (Poeschla et al., 1998). Disclosed herein is stable expression of the reporter 
gene lacZ in N2a cells, following perinatal systemic FlV(lacZ) administration. 

[92] hi certain embodiments the constructs become an integrated product with the 
20 genome of the host. For example, lentiviruses, such as HTV and LIV, have the characteristic of 
transfecting the therapeutic gene into the host chromosome, thus forming an integrated product. 
In certain embodiments, the requirement is that the vectors allow for expression in the 
periphery of the cell, such as the nerve cell, and/or at or near the point of pain. The contrast to 
integrated products is episomal products which can also be produced using, for example, HSV 
25 and AV vectors. Thus, transient expression can be beneficial. The optimal time of expression 
is correlated with the amount of product produced and amount that is needed. For example, in 
certain embodiments, expression for at least 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 45, 60, 90, 120, 
1 50, or 1 80 days is desirable. 

[93] A model system for the study of these vectors is a mouse that is knockout mouse 
30 deficient in ^-opioid receptor. 
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[94] Stable transduced are cells where a nucleic acid has been integrated into the cell 
genome. 

3. Delivery of the compositions to cells 

[95] Delivery can be applied, in general, via local or systemic routes of 
administration. Local administration includes virus injection directly into the region or organ of 
interest, versus intravenous (IV) or intraperitoneal (TP) injections (systemic) aiming at viral 
delivery to multiple sites and organs via the blood circulation. Previous research on the effects 
of local administration demonstrated gene expression limited to the site/organ of the injection, 
which did not extend to the rest of the body (Daly et al., 1999a; Kordower et al., 1999). 
Furthermore, previous studies have demonstrated successful global gene transfer to multiple 
tissues and organs in rodents and primates following viral IV and IP injections (Daly et al., 
1999b; Tarntal et al., 2001; McCormack et al., 2001; Lipschutz et al., 2001). Disclosed herein 
IP injection of FIV(lacZ) in mice of adult (3 months old) as well as of perinatal age (P4) 
resulted in global transfer and expression of the reporter gene lacZ in brain, liver, spleen and 
kidney. Also disclosed, the levels of expression achieved via IP injections were superior to 
those acquired following local administration directly into the liver. 

[96] There are a number of compositions and methods which can be used to deliver 
nucleic acids to cells, either in vitro or in vivo. These methods and compositions can largely be 
broken down into two classes: viral based delivery systems and non-viral based delivery 
systems. For example, the nucleic acids can be delivered through a number of direct delivery 
systems such as, electroporation, lipofection, calcium phosphate precipitation, plasmids, viral 
vectors, viral nucleic acids, phage nucleic acids, phages, cosmids, or via transfer of genetic 
material in cells or carriers such as cationic liposomes. Appropriate means for transfection, 
including viral vectors, chemical transfectants, or physico-mechanical methods such as 
electroporation and direct diffusion of DNA, are described by, for example, Wolff, J. A., et al., 
Science, 247, 1465-1468, (1990); and Wolf£ J. A. Nature, 352, 815-818,(1991). Such 
methods are well known in the art and readily adaptable for use with the compositions and 
methods described herein. In certain cases, the methods will be modified to specifically 
function with large DNA molecules. Further, these methods can be used to target certain 
diseases and cell populations by using the targeting characteristics of the carrier. 
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a) Nucleic acid based delivery systems 

[97] Transfer vectors can be any nucleotide construction used to deliver genes into 
cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part of 
recombinant retrovirus or adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). 

5 [98] As used herein, plasmid or viral vectors are agents that transport the disclosed 

nucleic acids, such as the ^-opioid receptor construct into the cell without degradation and 
include a promoter yielding expression of the |!-opioid receptor encoding sequences in the cells 
into which it is delivered. In some embodiments the vectors for the ji-opioid receptor constructs 
are derived from either a virus, retrovirus,* or lentivirus. Viral vectors can be, for example, 

10 Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS virus, 
neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV 
backbone, and lentiviruses. Also preferred are any viral families which share the properties of 
these viruses which make them suitable for use as vectors. Retroviruses include Murine 
Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of 

15 MMLV as a vector. Retroviral vectors are able to carry a larger genetic payload, i.e., a 

transgene, such as, the disclosed ^.-opioid receptor constructs or marker gene, than other viral 
vectors, and for this reason are a commonly used vector. However, they are not as useful in 
non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, have 
high titers, and can be delivered in aerosol formulation, and can transfect non-dividing cells. 

20 Pox viral vectors are large and have several sites for inserting genes, they are thermostable and 
can be stored at room temperature. A preferred embodiment is a viral vector, which has been 
engineered so as to suppress the immune response of the host organism, elicited by the viral 
antigens. Preferred vectors of this type will carry coding regions for Interleukin 8 or 10. 

[99] Viral vectors can have higher transaction (ability to introduce genes) abilities 
25 than chemical or physical methods to introduce genes into cells. Typically, viral vectors 
contain, nonstructural early genes, structural late genes, an RNA polymerase III transcript, 
inverted terminal repeats necessary for replication and encapsidation, and promoters to control 
the transcription and replication of the viral genome. When engineered as vectors, viruses 
typically have one or more of the early genes removed and a gene or gene/promotor cassette is 
30 inserted into the viral genome in place of the removed viral DNA. Constructs of this type can 
carry up to about 8 kb of foreign genetic material. The necessary functions of the removed early 
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genes are typically supplied by cell lines which have been engineered to express the gene 
products of the early genes in trans. 

(1) Retroviral Vectors 

[100] A retrovirus is an animal virus belonging to the virus family of Retroviridae, 
5 including any types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are 
described by Verma, I.M., Retroviral vectors for gene transfer. In Microbiology-1985, 
American Society for Microbiology, pp. 229-232, Washington, (1985), which is incorporated 
by reference herein. Examples of methods for using retroviral vectors for gene therapy are 
described in U.S. Patent Nos. 4,868,1 16 and 4,980,286; PCT applications WO 90/02806 and 
10 WO 89/07136; and Mulligan, (Science 260:926-932 (1993)); the teachings of which are 
incorporated herein by reference. 

[101] A retrovirus is essentially a package which has packed into it nucleic acid cargo. 
The nucleic acid cargo carries with it a packaging signal, which ensures that the replicated 
daughter molecules will be efficiently packaged within the package coat. In addition to the 

15 package signal, there are a number of molecules which are needed in cis, for the replication, and 
packaging of the replicated virus. Typically a retroviral genome, contains the gag, pol, and env 
genes which are involved in the making of the protein coat It is the gag, pol, and env genes 
which are typically replaced by the foreign DNA that it is to be transferred to the target cell. 
Retrovirus vectors typically contain a packaging signal for incorporation into the package coat, a 

20 sequence which signals the start of the gag transcription unit, elements necessary for reverse 
transcription, including a primer binding site to bind the tRNA primer of reverse transcription, 
terminal repeat sequences that guide the switch of RNA strands during DNA synthesis, a purine 
rich sequence 5 1 to the 3' LTR that serve as the priming site for the synthesis of the second 
strand of DNA synthesis, and specific sequences near the ends of the LTRs that enable the 

25 insertion of the DNA state of the retrovirus to insert into the host genome. The removal of the 
gag, pol, and env genes allows for about 8 kb of foreign sequence to be inserted into the viral 
genome, become reverse transcribed, and upon replication be packaged into a new retroviral 
particle. This amount of nucleic acid is sufficient for the delivery of a one to many genes 
depending on the size of each transcript. It is preferable to include either positive or negative 

30 selectable markers along with other genes in the insert. 

[102] Since the replication machinery and packaging proteins in most retroviral vectors 
have been removed (gag, pol, and env), the vectors are typically generated by placing them into 
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a packaging cell line. A packaging cell line is a cell line which has been transfected or 
transformed with a retrovirus that contains the replication and packaging machinery, but lacks 
any packaging signal. When the vector carrying the DNA of choice is transfected into these cell 
lines, the vector containing the gene of interest is replicated and packaged into new retroviral 
5 particles, by the machinery provided in cis by the helper cell. The genomes for the machinery 
are not packaged because they lack the necessary signals. 

(2) Adenoviral Vectors 

[1 03] The construction of replication-defective adenoviruses has been described 
(Berkneretal., J. Virology 61:1213-1220 (1987); Massie etal., Mol. Cell. Biol. 6:2872- 

10 2883 (1986); Haj-Ahmad et at, J. Virology 57:267-274 (1986); Davidson et aL, J. Virology 
61:1226-1239 (1987); Zhang "Generation and identification of recombinant adenovirus by 
liposome-mediated transfection and PCR analysis" BioTechniques 15:868-872(1993)). The 
benefit of the use of these viruses as vectors is that they are limited in the extent to which they 
can spread to other cell types, since they can replicate within an initial infected cell, but are 

15 unable to form new infectious viral particles. Recombinant adenoviruses have been shown to 
achieve high efficiency gene transfer after direct, in vivo delivery to airway epithelium, 
hepatocytes, vascular endothelium, CNS parenchyma and a number of other tissue sites (Morsy, 
J. Clin. Invest 92:1580-1586 (1993); Kirshenbaum, J. Clin. Invest 92:381-387(1993); 
Roessler, J. Clin. Invest. 92:1085-1092 (1993); Moullier, Nature Genetics 4:154-159 (1993); 

20 La Salle, Science 259:988-990 (1993); Gomez-Foix, J. Biol. Chem. 267:25129-25134 

(1992) ; Rich, Human Gene Therapy 4:461-476 (1993); Zabner, Nature Genetics 6:75-83 
(1994); Guzman, Circulation Research 73:1201-1207 (1993); Bout, Human Gene Therapy 5:3- 
10 (1994); Zabner, Cell 75:207-216 (1993); Caillaud, Eur. J. Neuroscience 5:1287-1291 

(1993) ; and Ragot, J. Gea Virology 74:501-507 (1993)). Recombinant adenoviruses achieve 
25 gene transduction by binding to specific cell surface receptors, after which the virus is 

internalized by receptor-mediated endocytosis, in the same manner as wild type or replication- 
defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); Brown and 
Burlingham, J. Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 
(1985); Seth, etal., J. Virol. 51:650-655 (1984); Seth, etal., Mol. Cell. Biol. 4:1528-1533 
30 (1984); Varga et aL, J. Virology 65:6061-6070 (1991); Wickham et at, Cell 73:309-3 19 
(1993)). 
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[104] A viral vector can be one based on an adenovirus which has had the El gene 
removed and these virions are generated in a cell line such as the human 293 cell line. In 
another preferred embodiment both the El and E3 genes are removed from the adenovirus 
genome. 

5 (3) Adeno-asscociated viral vectors 

[105] Another type of viral vector is based on an adeno-associated virus (AAV). This 
defective parvovirus is a preferred vector because it can infect many cell types and is 
nonpathogenic to humans. AAV type vectors can transport about 4 to 5 kb and wild type AAV 
is known to stably insert into chromosome 19. Vectors which contain this site specific 
1 0 integration property are preferred. An especially preferred embodiment of this type of vector is 
the P4. 1 C vector produced by Avigen, San Francisco, CA, which can contain the herpes 
simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the gene encoding 
the green fluorescent protein, GFP. 

[106] In another type of AAV virus, the AAV contains a pair of inverted terminal 
15 repeats (ITRs) which flank at least one cassette containing a promoter which directs cell-specific 
expression operably linked to a heterologous gene. Heterologous in this context refers to any 
nucleotide sequence or gene which is not native to the AAV or B 1 9 parvovirus. 

[107] Typically the AAV and B19 coding regions have been deleted, resulting in a safe, 
noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity and site- 
20 specific integration, but not cytotoxicity, and the promoter directs cell-specific expression. 
United states Patent No. 6,261,834 is herein incorporated by reference for material related to 
the AAV vector. 

[108] The vectors of the present invention thus provide DNA molecules which are 
capable of integration into a mammalian chromosome without substantial toxicity. 

25 [109] The inserted genes in viral and retroviral usually contain promoters, and/or 

enhancers to help control the expression of the desired gene product A promoter is generally a 
sequence or sequences of DNA that function when in a relatively fixed location in regard to the 
transcription start site. A promoter contains core elements required for basic interaction of 
RNA polymerase and transcription factors, and can contain upstream elements and response 

30 elements. 

(4) Lentiviral vectors 
— 30 — 
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[01 ] The vectors can be lentiviral vectors, including but not limited to, SIV vectors, 
HIV vectors or a hybrid construct of these vectors, including viruses with the HIV backbone. 
These vectors also include first, second and third generation lentiviruses. Third generation 
lentiviruses have lentiviral packaging genes split into at least 3 independent plasmids or 
constructs. Also vectors can be any viral family that shares the properties of these viruses which 
make them suitable for use as vectors. Lentiviral vectors are a special type of retroviral vector 
which are typically characterized by having a long incubation period for infection. Furthermore, 
lentiviral vectors can infect non-dividing cells. Lentiviral vectors are based on the nucleic acid 
backbone of a virus from the lentiviral family of viruses. Typically, a lentiviral vector contains 
the 5 f and 3 1 LTR regions of a lentivirus, such as SIV and HIV. Lentiviral vectors also typically 
contain the Rev Responsive Element (RRE) of a lentivirus, such as SIV and HIV. 

(a) Feline immunodeficiency viral vectors 

[110] One type of vector that the disclosed constructs can be delivered in is the VS V-G 
pseudotyped Feline Immunodeficiency Virus system developed by Poeschla a/. (1998). This 
lentivirus has been shown to efficiently infect dividing, growth arrested as well as post-mitotic 
cells. Furthermore, due to its lentiviral properties, it allows for incorporation of the transgene 
into the host's genome, leading to stable gene expression. This is a 3^vector system, whereby 
each confers distinct instructions: the FIV vector carries the transgene of interest and lentiviral 
apparatus with mutated packaging and envelope genes. A vesicular stomatitis virus G- 
glycoprotein vector (VSV-G; Burns et al., 1993) contributes to the formation of the viral 
envelope in trans. The third vector confers packaging instructions in trans (Poeschla et al 9 
1998). FIV production is accomplished in vitro following co-transfection of the aforementioned 
vectors into 293-T cells. The FIV-rich supernatant is then collected, filtered and can be used 
directly or following concentration by centrifugation. Titers routinely range between 10 4 - 10 7 
bfu/ml.. 

(5) Packaging vectors 

[111] As discussed above, retroviral vectors are based on retroviruses which contain a 
number of different sequence elements that control things as diverse as integration of the virus, 
replication of the integrated virus, replication of un-integrated virus, cellular invasion, and 
packaging of the virus into infectious particles. While the vectors in theoiy could contain all of 
their necessary elements, as well as an exogenous gene element (if the exogenous gene element 
is small enough) typically many of the necessary elements are removed. Since all of the 
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packaging and replication components have been removed from the typical retroviral, including 
lentiviral, vectors which will be used within a subject, the vectors need to be packaged into the 
initial infectious particle through the use of packaging vectors and packaging cell lines. 
Typically retroviral vectors have been engineered so that the myriad functions of the retrovirus 
5 are separated onto at least two vectors, a packaging vector and a delivery vector. This type of 
system then requires the presence of all of the vectors providing all of the elements in the same 
cell before an infectious particle can be produced. The packaging vector typically carries the 
structural and replication genes derived from the retrovirus, and the delivery vector is the vector 
that carries the exogenous gene element that is preferably expressed in the target cell. These 
10 types of systems can split the packaging functions of the packaging vector into multiple vectors, 
e.g., third-generation lentivirus systems. Dull, T. et al., "A Third-generation lentivirus vector 
with a conditional packaging system" J. Virol 72(1 1):8463-71 (1998) 

[112] Retroviruses typically contain an envelope protein (env). The Env protein is in 
essence the protein which surrounds the nucleic acid cargo. Furthermore cellular infection 
15 specificity is based on the particular Env protein associated with a typical retrovirus. In typical 
packaging vector/delivery vector systems, the Env protein is expressed from a separate vector 
than for example the protease (pro) or integrase (in) proteins. 

(6) Packaging cell lines 

[113] The vectors are typically generated by placing them into a packaging cell line. A 
20 packaging cell line is a cell line which has been transfected or transformed with a retrovirus that 
contains the replication and packaging machinery, but lacks any packaging signal. When the 
vector carrying the DNA of choice is transfected into these cell lines, the vector containing the 
gene of interest is replicated and packaged into new retroviral particles, by the machinery 
provided in cis by the helper cell. The genomes for the machinery are not packaged because 
25 they lack the necessary signals. One type of packaging cell line is a 293 cell line. 

(7) Large payload viral vectors y 

[114] Molecular genetic experiments with large human herpesviruses have provided a 
means whereby large heterologous DNA fragments can be cloned, propagated and established in 
cells permissive for infection with herpesviruses (Sun et al., Nature genetics 8: 33-41, 1994; 
30 Cotter and Robertson. Curr Opin Mol Ther 5: 633-644, 1999). These large DNA viruses 
(herpes simplex virus (HS V) and Epstein-Barr virus (EBV), have the potential to deliver 
fragments of human heterologous DNA > 1 50 kb to specific cells. EBV recombinants can 
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maintain large pieces of DNA in the infected B-cells as episomal DNA. Individual clones 
carried human genomic inserts up to 330 kb appeared genetically stable The maintenance of 
these episomes requires a specific EBV nuclear protein, EBNA1, constitutively expressed 
dining infection with EBV. Additionally, these vectors can be used for transfection, where large 
5 amounts of protein can be generated transiently in vitro. Herpesvirus amplicon systems are also 
being used to package pieces of DNA > 220 kb and to infect cells that can stably maintain DNA 
as episomes. 

[115] Other useful systems include, for example, replicating and host-restricted non- 
replicating vaccinia virus vectors. 

10 b) Non-nucleic acid based systems 

[116] The disclosed compositions can be delivered to the target cells in a variety of 
ways. For example, the compositions can be delivered through electroporation, or through 
lipofection, or through calcium phosphate precipitation. The delivery mechanism chosen will 
depend in part on the type of cell targeted and whether the delivery is occurring for example in 
15 vivo or in vitro. 

[117] Thus, the compositions can comprise, in addition to the disclosed constructs or 
vectors for example, lipids such as liposomes, such as cationic liposomes (e.g., DOTMA, 
DOPE, DC-cholesterol) or anionic liposomes. Liposomes can further comprise proteins to 
facilitate targeting a particular cell, if desired. Administration of a composition comprising a 

20 compound and a cationic liposome can be administered to the blood afferent to a target organ or 
inhaled into the respiratory tract to target cells of the respiratory tract Regarding liposomes, 
see, e.g., Brigham et al. Am. J. Resp. Cell Mol Biol 1:95-100 (1989); Feigner etal. Proc. 
Natl Acad. Sci USA 84:7413-7417 (1987); U.S. Pat No.4,897,355. Furthermore, the 
compound can be administered as a component of a microcapsule that can be targeted to 

25 specific cell types, such as macrophages, or where the diffusion of the compound or delivery of 
the compound from the microcapsule is designed for a specific rate or dosage. 

[118] In the methods described above which include the administration and uptake of 
exogenous DNA into the cells of a subject (i.e., gene transduction or transfection), delivery of 
the compositions to cells can be via a variety of mechanisms. As one example, delivery can be 
30 via a liposome, using commercially available liposome preparations such as LIPOFECTIN, 
LffOFECT AMINE (GIBCO-BRL, Inc., Gaithersburg, MD), SUPERFECT (Qiagen, Inc. 
Hilden, Germany) and TRANSFECTAM (Promega Biotec, Inc., Madison, WI), as well as other 
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liposomes developed according to procedures standard in the art. In addition, the nucleic acid or 
vector of this invention can be delivered in vivo by electroporation, the technology for which is 
available from Genetronics, Inc. (San Diego, CA) as well as by means of a SONOPORAHON 
machine (ImaRx Pharmaceutical Corp., Tucson, AZ). 

5 [H9] The materials can be in solution, suspension (for example, incorporated into 

microparticles, liposomes, or cells). These can be targeted to a particular cell type via 
antibodies, receptors, or receptor ligands. The following references are examples of the use of 
this technology to target specific proteins to tumor tissue (Senter, et al., Bioconiugate Chem.. 
2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer. 60:275-281, (1989); Bagshawe, et al., Br, 

10 J. Cancer. 58:700-703, (1988); Senter, et al, Bioconiugate Chem.. 4:3-9, (1993); Battelli, et al., 
Cancer Immunol. Immunother.. 35:421-425, (1992); Pietersz and McKenzie , Immunolog. 
Reviews. 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol 42:2062-2065, (1991)). 
These techniques can be used for a variety of other specific cell types. Vehicles such as 
"stealth" and other antibody conjugated liposomes (including lipid mediated drug targeting to 

1 5 colonic carcinoma), receptor mediated targeting of DNA through cell specific ligands, 

lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting of 
murine glioma cells in vivo. The following references are examples of the use of this 
technology to target specific proteins to tumor tissue (Hughes et al., Cancer Research. 
49:6214-6220, (1989); and Litzinger and Huang, Biochimica et Biophvsica Acta. 1 104:179-187, 

20 (1 992)). In general, receptors are involved in pathways of endocytosis, either constitutive or 
ligand induced. These receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated 
vesicles, pass through an acidified endosome in which the receptors are sorted, and then either 
recycle to the cell surface, become stored intracellularly, or are degraded in lysosomes. The 
internalization pathways serve a variety of functions, such as nutrient uptake, removal of 

25 activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, 
dissociation and degradation of ligand, and receptor-level regulation. Many receptors follow 
more than one intracellular pathway, depending on the cell type, receptor concentration, type of 
ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms of 
receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 

30 10:6,399-409(1991)). 

[120] Nucleic acids that are delivered to cells which are to be integrated into the host 
cell genome, typically contain integration sequences. These sequences are often viral related 
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sequences, particularly when viral based systems are used. These viral integration systems can 
also be incorporated into nucleic acids which are to be delivered using a non-nucleic acid based 
system of deliver, such as a liposome, so that the nucleic acid contained in the delivery system 
can be come integrated into the host genome. 

5 [121] Other general techniques for integration into the host genome include, for 

example, systems designed to promote homologous recombination with the host genome. These 
systems typically rely on sequence flanking the nucleic acid to be expressed that has enough 
homology with a target sequence within the host cell genome that recombination between the 
vector nucleic acid and the target nucleic acid takes place, causing the delivered nucleic acid to 
1 0 be integrated into the host genome. These systems and the methods necessary to promote 
homologous recombination are known to those of skill in the art 

c) In vivo/ex vivo * 

[122] As described herein, the compositions can be administered in a pharmaceutically 
acceptable carrier and can be delivered to the subject's cells in vivo and/or ex vivo by a variety 
15 of mechanisms well known in the art (e.g., uptake of naked DNA, liposome fusion, 
intramuscular injection of DNA via a gene gun, endocytosis and the like). 

[123] If ex vivo methods are employed, cells or tissues can be removed and maintained 
outside the body according to standard protocols well known in the art. The compositions can 
be introduced into the cells via any gene transfer mechanism, such as, for example, calcium 
20 phosphate mediated gene delivery, electroporation, microinjection or proteoliposomes. The 
transduced cells can then be infused (e.g., in a pharmaceutically acceptable carrier) or 
homotopically transplanted back into the subject per standard methods for the cell or tissue type. 
Standard methods are known for transplantation or infusion of various cells into a subject. 

[ 1 24] If in vivo delivery methods are performed the methods can be designed to deliver 
25 the nucleic acid constructs directly to a particular cell type, via any delivery mechanism, such as 
intra-peritorieal injection of a vector construct. In this type of delivery situation, the nucleic acid 
constructs can be delivered to any type of tissue, for example, brain or neural or muscle. The 
nucleic acid constructs can also be delivered such that they generally deliver the nucleic acid 
constructs to more than one type of cell. This type of delivery can be accomplished, by for 
30 example, injecting the constructs intraperitoneally into the flank of the organism. (See Example 
2 and figures 8-10). In certain delivery methods, the timing of the deli very is monitored. For 
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example, the nucleic acid constructs can be delivered at the perinatal stage of the recipient's life 
or at the adult stage. 

[125] The various vectors delivering the opioid receptors, such as the m-opioid 
receptor can be delivered to differentiated cells. For example, cells that are quiescent can be 
5 targeted with the disclosed vectors in certain embodiments. For example, nerve cells, which are 
no longer dividing, or are dividing very slowly, can be transfected with the disclosed 
compositions in certain embodiments. The nucleic acids can be delivered peripherally in certain 
embodiments and can be delivered by injection at a site distal to the body of the cell. For 
example, pain may be initiated at a point in the foot of an organism, but the body of the nerve 

10 transmitting the pain signal will be located at or near the spinal cord. In certain embodiments, 
the compositions can be delivered at the foot, transfecting the distal part of the nerve, including 
the most distal part of the nerve. Transfection, can take place along the full length of the cell, 
however. In certain embodiments, the vectors are delivered by injection at a site distal to a 
nerve body, or, for example, at the point of the pain with regard to where the body of the nerve 

15 is located. 

4. Expression systems 

[126] The nucleic acids that are delivered to cells typically contain expression 
controlling systems. For example, the inserted genes in viral and retroviral systems usually 
contain promoters, and/or enhancers to help control the expression of the desired gene product 
20 A promoter is generally a sequence or sequences of DNA that function when in a relatively 
fixed location in regard to the transcription start site. A promoter contains core elements 
required for basic interaction of RNA polymerase and transcription factors, and can contain 
upstream elements and response elements. 

a) Promoters and Enhancers 

25 [127] Preferred promoters controlling transcription from vectors in mammalian host 

cells can be obtained from various sources, for example, the genomes of viruses such as: 
polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most 
preferably cytomegalovirus, or from heterologous mammalian promoters, e.g. beta actin 
promoter. The early and late promoters of the S V40 virus are conveniently obtained as an SV40 

30 restriction fragment which also contains the SV40 viral origin of replication (Fiers et al., Nature. 
273 : 113(1 978)). The immediate early promoter of the human cytomegalovirus is conveniently 
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obtained as a Hindm E restriction fragment (Greenway, P.J. et al., Gene 18: 355-360 (1982)). 
Of course, promoters from the host cell or related species also are useful herein. 

[128] Enhancer generally refers to a sequence of DNA that functions at no fixed 
distance from the transcription start site and can be either 5' (Laimins, L. et al., Proc. Natl. 
5 Acad. Sci. 7 8: 993 (1981)) or 3 f (Lusky, M.L., et al., Mol. Cell Bio. 3: 1 108 (1983)) to the 
transcription unit. Furthermore, enhancers can be within an intron (Banerji, J.L. et al., Cell 33: 
729 (1983)) as well as within the coding sequence itself (Osborne, T.F., et al., Mol. Cell Bio. 
4: 1293 (1984)). They are usually between 10 and 300 bp in length, and they function in cis. 
Enhancers f unction to increase transcription from nearby promoters. Enhancers also often 

10 contain response elements that mediate the regulation of transcription. Promoters can also 
contain response elements that mediate the regulation of transcription. Enhancers often 
determine the regulation of expression of a gene. While many enhancer sequences are now 
known from mammalian genes (globin, elastase, albumin, -fetoprotein and insulin), typically 
one will use an enhancer from a eukaryotic cell virus for general expression. Preferred 

15 examples are the SV40 enhancer on the late side of the replication origin (bp 1 00-270), the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
replication origin, and adenovirus enhancers. 

[129] The promoter and/or enhancer can be specifically activated either by light or 
specific chemical events which trigger their function. Systems can be regulated by reagents 
20 such as tetracycline and dexamethasone. There are also ways to enhance viral vector gene 
expression by exposure to irradiation, such as gamma irradiation, or alkylating chemotherapy 
drugs. 

[130] In certain embodiments the promoter and/or enhancer region can act as a 
constitutive promoter and/or enhancer to maximize expression of the region of the transcription 
25 unit to be transcribed. In certain constructs the promoter and/or enhancer region be active in all 
eukaryotic cell types, even if it is only expressed in a particular type of cell at a particular time. 
A preferred promoter of this type is the CMV promoter (650 bases). Other preferred promoters 
are SV40 promoters, cytomegalovirus (full length promoter), and retroviral vector LTF. 

[131] It has been shown that all specific regulatory elements can be cloned and used to 
30 construct expression vectors that are selectively expressed in specific cell types such as 
melanoma cells. The glial fibrillary acetic protein (GFAP) promoter has been used to 
selectively express genes in cells of glial origin. 
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[ 1 32] Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
animal, human or nucleated cells) can also contain sequences necessary for the termination of 
transcription which can affect mRNA expression. These regions are transcribed as 
polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor 
5 protein. The 3' untranslated regions also include transcription termination sites. It is preferred 
that the transcription unit also contains a polyadenylation region. One benefit of this region is 
that it increases the likelihood that the transcribed unit will be processed and transported like 
mRNA. The identification and use of polyadenylation signals in expression constructs is well 
established. It is preferred that homologous polyadenylation signals be used in the transgene 
10 constructs. In certain transcription units, the polyadenylation region is derived from the SV40 
early polyadenylation signal and consists of about 400 bases. It is also preferred that the 
transcribed units contain other standard sequences alone or in combination with the above 
sequences improve expression from, or stability of, the construct 

b) Constitutive promoters 

15 [ 1 33] In certain embodiments the promoters are constitutive promoters. This can be 

any promoter that causes transcription regulation in the absence of the addition of other factors. 
Examples of this type of promoter are the CMV promoter and the beta actin promoter, as well as 
others dicussed herein. In certain embodiments the promoter can consist of fusions of one or 
more different types of promoters. For example, the regulatory regions of the CMV promoter 

20 and the beta actin promoter are well known and understood, examples, of which aire disclosed 
herein. Parts of these promoters can be fused together to, for example, produce a CMV-beta 
actin fusion promoter. It is understood that this type of promoter has a CMV component and a 
beta actin component These components can function independently as promoters, and thus, 
are themselves considered beta actin promoters and CMV promoters. A promoter can be any 

25 portion of a known promoter that causes promoter activity. It is well understood that many 
promoters, including the CMV and Beta Actin promoters have functional domains which are 
understood and that these can be used as a beta actin promoter or CMV promoter. Furthermore, 
these domains can be determined. There are many CMV promoter variations that exist, as well 
as beta actin promoters, and fusion promoters. These promoters can be compared, and for 

30 example, functional regions delineated, as described herein. Furthermore, each of these 
sequences can function independently or together in any combination to provide a promoter 
region for the disclosed nucleic acids. 
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c) Non-constitutive promoters 

[134] The promoters can also be non-constitutive promoters, such as cell specific 
promoters. These are promoters that are turned on at specific time in development or stage or a 
particular type of cell, such as a cardiac cell, or neural cell, or a bone cell. Some examples of 
5 cell specific promoters are, the neural enolase specifc promoter, (NSE) the COLLI Al 
procollagen promoter, and the CD1 lb promoter (PBMC-microglia/macrophage/monocyte 
specific promoter. 

[02] It is understood that the recombinant systems can be expressed in a tissue- 
specific manner. It is understood that tissue specific expression can occur due to the presence of 

10 a tissue-specific promoter. Typically, proteins under control of a tissue-specific promoter are 
transcribed when the promoter becomes active by virtue of being present in the tissue for which 
it is specific. Therefore, all cells can encode for a particular gene without global expression. As 
such, labeled proteins can be shown to be present in certain tissues without expression in other 
nearby tissues that may complicate results or expression of proteins in tissues where expression 

15 may be detrimental to the host Disclosed are methods wherein the ere recombinase is under the 
control of the EIIA promoter, a promoter specific for breast tissue, such as the WAP promoter, a 
promoter specific for ovarian tissue, such as the ACTB promoter, or a promoter specific for 
bone tissue, such as osteocalcin. Any tissues specific promoter can be used. Promoters specific 
for prostate, testis, and neural are also disclosed. Examples of some tissue-specific promoters 

20 include but are not limited to MUC1, EIIA, ACTB, WAP, bHLH-EC2, HOXA-1, Alpha- 

fetoprotein (AFP), opsin, CR1/2, Fc-y-Recejrtor 1 (Fc-y-Rl), MMTVD-LTR, the human insulin 
promoter, Pdha-2, rat neuron-specific enolase. For example, use of the AFP promoter creates 
specificity for the liver. Another example, HOXA-1 is a neuronal tissue specific promoter, and 
as such, proteins expressed under the control of HOXA-1 are only expressed in neuronal tissue. 

25 (All of which are herein incorporated by reference at least for the sequence of the promoters 
and related sequences.) 

[135] Other cell specific promoters can be found in (Sutcliffe, J.G. (1988), Ann. Rev. 
Neuroscience 1 1, 157-198). For example, when transfecting nerve cells, there are a variety of 
nerve specific promoters, such as the neuron specific enolase promoter. Other examples of 
30 neuron specific promoters would be the Tau promoter, Synapsin I (Hoesche, C, Sauerwald, A., 
et al., (1993) J. Biol Chem. 268, 26494-26502.and H (Chin, L.-S et al., (1994), J. Biol Chem. 
269, 18507-18513) promoters, the amino acid decarboxylase (AADQ (Albert, V., et aL, (1992), 
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Proc. Natl Acad. Sci. 89, 12053-12057) and FE65 (Faraonio, R., et al., (1994), Nucl Acids Res. 
22, 4876-4883) promoters. Other nerve specific promoters include, the promoter for the WT1 
gene (Fraizer, G, et al., (1994), 7. Biol. Chem. 269, 8892-8900), microfilament light chain 
promoter (Yazdanbakhsh, K., et al, (1993) Nucl Acids Res. 21, 455-461), and the glial 
5 fibrillary acidic protein, (Kaneko, R. & Sueoka, N. (1 993) Proc. Natl Acad. ScL 90, 4698- 
4702). (All of which are herein incorporated by reference at least for the sequence of the 
promoters and related sequences.) 

[136] Expression of the transgene can be targeted selectively to neurons by cloning a 
neuron specific promoter, such as the NSE promoter as disclosed herein (Liu H. et al., Journal 

10 ofNeuroscience. 23(18):7143-54, 2003); tyrosine hydroxylase promoter (Kessler MA. et al., 
Brain Research. Molecular Brain Research. 1 12(1-2) :8-23, 2003); myelin basic protein 
promoter (Kessler MA. et al Biochemical & Biophysical Research Communications. 
288(4) :809- 18, 2001); glial fibrillary acidic protein promoter (Nolte C. et al., GLIA. 33(1): 72- 
86, 2001); neurofilaments gene (heavy, medium, light) promoters (Yaworsky PJ.et al., Journal 

15 of Biological Chemistry. 272(40) :25 112-20, 1997 ) (All of which are herein incorporated by 
reference at least for the sequence of the promoters and related sequences.) The NSE promoter 
is disclosed in Peel AL. et al., Gene Therapy. 4(1): 16-24, 1 997 >(SEQ ID NO:69) (pTR- 
NT3myc; Powell Gene Therapy Center, University of Florida, Gainesville FL). 

d) Markers 

20 [137] The viral vectors can include nucleic acid sequence encoding a marker product 

This marker product is used to determine if the gene has been delivered to the cell and once 
delivered is being expressed. Preferred marker genes are the E. Coli lacZ gene, which encodes 
6-galactosidase, and green fluorescent protein. 

[138] In some embodiments the marker can be a selectable marker. Examples of 
25 suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR), thymidine 
kinase, neomycin, neomycin analog G418, hydromycin, and puromycin. When such selectable 
markers are successfully transferred into a mammalian host cell, the transformed mammalian 
host cell can survive if placed under selective pressure. There are two widely used distinct 
categories of selective regimes. The first category is based on a cell's metabolism and the use of 
30 a mutant cell line which lacks the ability to grow independent of a supplemented media. Two 
examples are: CHO DHFR- cells and mouse LTK- cells. These cells lack the ability to grow 
without the addition of such nutrients as thymidine or hypoxanthine. Because these cells lack 
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certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless 
the missing nucleotides are provided in a supplemented media. An alternative to supplementing 
the media is to introduce an intact DHFR or TK gene into cells lacking the respective genes, 
thus altering their growth requirements. Individual cells which were not transformed with the 
5 DHFR or TK gene will not be capable of survival in non-supplemented media. 

[139] The second category is dominant selection which refers to a selection scheme 
used in any cell type and does not require the use of a mutant cell line. These schemes typically 
use a drug to arrest growth of a host cell. Those cells which have a novel gene would express a 
protein conveying drug resistance and would survive the selection. Examples of such dominant 

10 selection use the drugs neomycin, (Southern P. and Berg, P., J. Molec. AppL Genet. 1: 327 
(1982)), mycophenolic acid, (Mulligan, R.C. and Berg, P. Science 209: 1422 (1980)) or 
hygromycin, (Sugden, B. et al., Mol Cell BioL 5: 410-413 (1985)). The three examples 
employ bacterial genes under eukaryotic control to convey resistance to the appropriate drug 
G4 1 8 or neomycin (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others 

1 5 include the neomycin analog G4 1 8 and puramycin. 

e) Post transcriptional regulatory elements 

[140] The disclosed vectors can also contain post-transcriptional regulatory elements. 
Post-transcriptional regulatory elements can enhance mRNA stability or enhance translation of 
the transcribed mRNA. An exemplary post-transcriptional regulatory sequence is the WPRE 
20 sequence isolated from the woodchuck hepatitis virus. (Zufferey R, et al., "Woodchuck 
hepatitis virus post-transcriptional regulatory element enhances expression of transgenes 
delivered by retroviral vectors," J Virol; 73:2886-92 (1999)). Post-transcriptional regulatory 
elements can be positioned both 3* and 5' to the exogenous gene, but it is preferred that they are 
positioned 3' to the exogenous gene. 

25 f) Transduction efficiency elements 

[141] Transduction efficiency elements are sequences that enhance the packaging and 
transduction of the vector. These elements typically contain polypurine sequences. An example 
of a transduction efficiency element is the ppt-cts sequence that contains the central polypurine 
tract (ppt) and central terminal site (cts) from the FTV. These sequences are in the disclosed FIV 
30 sequences herein. Each retrovirus and lentivirus can have there own ppt-cts. 

g) 3* untranslated regions 
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[142] Expression vectors used in eukaiyotic host cells (yeast, fungi, insect, plant, 
animal, human or nucleated cells) can also contain sequences necessary for the termination of 
transcription which can affect mRNA expression. These 3 1 untranslated regions are transcribed 
as polyadenylated segments in the untranslated portion of the mRNA encoding the exogenous 
5 gene. The 3 ! untranslated regions also include transcription termination sites. The transcription 
unit also can contain a polyadenylation region. One benefit of this region is that it increases the 
likelihood that the transcribed unit will be processed and transported like mRNA. The 
identification and use of polyadenylation signals in expression constructs is well established. 
Homologous polyadenylation signals can be used in the transgene constructs. In an embodiment 
10 of the transcription unit, the polyadenylation region is derived from the SV40 early 

polyadenylation signal and consists of about 400 bases. Transcribed units can contain other 
standard sequences alone or in combination with the above sequences improve expression from, 
or stability of, the construct. 

5. Sequence similarities 

15 [ 143] It is understood that as discussed herein the use of the terms homology and 

identity mean the same thing as similarity. Thus, for example, if the use of the word homology 
is used between two non-natural sequences it is understood that this is not necessarily indicating 
an evolutionary relationship between these two sequences, but rather is looking at the similarity 
or relatedness between their nucleic acid sequences. Many of the methods for determining 

20 homology between two evolutionarily related molecules are routinely applied to any two or 
more nucleic acids or proteins for the purpose of measuring sequence similarity regardless of 
whether they are evolutionarily related or not. 

[144] In general, it is understood that one way to define any known variants and 
derivatives or those that might arise, of the disclosed genes and proteins herein, is through 

25 defining the variants and derivatives in terms of homology to specific known sequences. This 
identity of particular sequences disclosed herein is also discussed elsewhere herein. In general, 
variants of genes and proteins herein disclosed typically have at least, about 70, 71, 72, 73, 74, 
75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 
percent homology to the stated sequence or the native sequence. Those of skill in the art readily 

30 understand how to determine the homology of two proteins or nucleic acids, such as genes. For 
example, the homology can be calculated after aligning the two sequences so that the homology 
is at its highest level. 
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[145] Another way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison can be conducted by the local homology 
algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment 
algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1 970), by the search for similarity 
5 method of Pearson and Lipman, Proc. Nail. Acad. Sci. U.S.A. 85: 2444 (1988), by 

computerized implementations of these algorithms (GAP, BESTFTT, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI), or by inspection. 

[146] The same types of homology can be obtained for nucleic acids by for example 
10 the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger etal. Proc. Natl. Acad. 
Sci. USA 86:7706-7710, 1989, Jaeger etal. Methods Enzymol. 183:281-306, 1989 which are 
herein incorporated by reference for at least material related to nucleic acid alignment It is 
understood that any of the methods typically can be used and that in certain instances the results 
of these various methods can differ, but the skilled artisan understands if identity is found with 
1 5 at least one of these methods, the sequences would be said to have the stated identity, and be 
disclosed herein. 

[147] For example, as used herein, a sequence recited as having a particular percent 
homology to another sequence refers to sequences that have the recited homology as calculated 
by any one or more of the calculation methods described above. For example, a first sequence 

20 has 80 percent homology, as defined herein, to a second sequence if the first sequence is 
calculated to have 80 percent homology to the second sequence using the Zuker calculation 
method even if the first sequence does not have 80 percent homology to the second sequence as 
calculated by any of the other calculation methods. As another example, a first sequence has 80 
percent homology, as defined herein, to a second sequence if the first sequence is calculated to 

25 have 80 percent homology to the second sequence using both the Zuker calculation method and 
the Pearson and Lipman calculation method even if the first sequence does not have 80 percent 
homology to the second sequence as calculated by the Smith and Waterman calculation method, 
the Needleman and Wunsch calculation method, the Jaeger calculation methods, or any of the 
other calculation methods. As yet another example, a first sequence has 80 percent homology, 

30 as defined herein, to a second sequence if the first sequence is calculated to have 80 percent 
homology to the second sequence using each of calculation methods (although, in practice, the 
different calculation methods will often result in different calculated homology percentages). 
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6. Hybridization/selective hybridization 

[148] The term hybridization typically means a sequence driven interaction between at 
least two nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven 
interaction means an interaction that occurs between two nucleotides or nucleotide analogs or 
5 nucleotide derivatives in a nucleotide specific manner. For example, G interacting with C or A 
interacting with T are sequence driven interactions. Typically sequence driven interactions 
occur on the Watson-Crick face or Hoogsteen face of the nucleotide. The hybridization of two 
nucleic acids is affected by a number of conditions and parameters known to those of skill in the 
art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether 
10 two nucleic acid molecules will hybridize. 

[149] Parameters for selective hybridization between two nucleic acid molecules are 
well known to those of skill in the art. For example, in some embodiments selective 
hybridization conditions can be defined as stringent hybridization conditions. For example, 
stringency of hybridization is controlled by both temperature and salt concentration of either or 

1 5 both of the hybridization and washing steps. For example, the conditions of hybridization to 
achieve selective hybridization can involve hybridization in high ionic strength solution (6X 
SSC or 6X SSPE) at a temperature that is about 12-25°C below the Tm (the melting temperature 
at which half of the molecules dissociate from their hybridization partners) followed by washing 
at a combination of temperature and salt concentration chosen so that the washing temperature 

20 is about 5°C to 20°C below the Tm. The temperature and salt conditions are readily 
determined empirically in preliminary experiments in which samples of reference DNA 
immobilized on filters are hybridized to a labeled nucleic acid of interest and then washed under 
conditions of different stringencies. Hybridization temperatures are typically higher for DNA- 
RNA and RNA-RNA hybridizations. The conditions can be used as described above to achieve 

25 stringency, or as is known in the art (Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989; 
Kunkeletal. Methods Enzymol. 1987:154:367, 1987 which is herein incorporated by reference 
for material at least related to hybridization of nucleic acids). A preferable stringent 
hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous 

30 solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization and 
washing, if desired, can be reduced accordingly as the degree of complementarity desired is 
decreased, and further, depending upon the G-C or A-T richness of any area wherein variability 
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is searched for. Likewise, stringency of hybridization and washing, if desired, can be increased 
accordingly as homology desired is increased, and further, depending upon the G-C or A-T 
richness of any area wherein high homology is desired, all as known in the art. 

[1 50] Another way to define selective hybridization is by looking at the amount 
5 (percentage) of one of the nucleic acids bound to the other nucleic acid. For example, in some 
embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 
72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 
97, 98, 99, 100 percent of the limiting nucleic acid is bound to the non-limiting nucleic acid. 
Typically, the non-limiting primer is in for example, 10 or 100 or 1000 fold excess. This type of 
10 assay can be performed at under conditions where both the limiting and non-limiting primer are 
for example, 10 fold or 100 fold or 1000 fold below their kd, or where only one of the nucleic 
acid molecules is 1 0 fold or 1 00 fold or 1 000 fold or where one or both nucleic acid molecules 
are above their kd- 

[151] Another way to define selective hybridization is by looking at the percentage of 
15 primer that gets enzymatically manipulated under conditions where hybridization is required to 
promote the desired enzymatic manipulation. For example, in some embodiments selective 
hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of 
the primer is enzymatically manipulated under conditions which promote the enzymatic 
20 manipulation, for example if the enzymatic manipulation is DNA extension, then selective 

hybridization conditions would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of 
the primer molecules are extended. Preferred conditions also include those suggested by the 
manufacturer or indicated in the art as being appropriate for the enzyme performing the 
25 manipulation. 

[152] Just as with homology, it is understood that there are a variety of methods herein 
disclosed for determining the level of hybridization between two nucleic acid molecules. It is 
understood that these methods and conditions can provide different percentages of hybridization 
between two nucleic acid molecules, but unless otherwise indicated meeting the parameters of 
30 any of the methods would be sufficient. For example if 80% hybridization was required and as 
long as hybridization occurs within the required parameters in any one of these methods it is 
considered disclosed herein. 
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[1 53] It is understood that those of skill in the art understand that if a composition or 
method meets any one of these criteria for determining hybridization either collectively or singly 
it is a composition or method that is disclosed herein. 

7. Nucleic acids 

5 [1 54] There are a variety of molecules disclosed herein that are nucleic acid based, 

including for example the nucleic acids that encode, for example )n-opioid receptor, or 
functional nucleic acids. The disclosed nucleic acids can be made up of for example, 
nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and 
other molecules are discussed herein. It is understood that for example, when a vector is 
10 expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. 

Likewise, it is understood that if, for example, an antisense molecule is introduced into a cell or 
cell environment through for example exogenous delivery, it is advantagous that the antisense 
molecule be made up of nucleotide analogs that reduce the degradation of the antisense 
molecule in the cellular environment 

15 [155] A nucleotide is a molecule that contains a base moiety, a sugar moiety and a 

phosphate moiety. Nucleotides can be linked together through their phosphate moieties and 
sugar moieties creating an internucleoside linkage. The base moiety of a nucleotide can be 
adenin-9-yl (A), cytosin-i-yl (C), guanin-9-yl (G), uracil-l-yl (U), and thymin-l-yl (T). The 
sugar moiety of a nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide 

20 is pentavalent phosphate. A non-limiting example of a nucleotide would be 3'-AMP (3- 
adenosine monophosphate) or 5-GMP (5-guanosine monophosphate). 

[156] A nucleotide analog is a nucleotide which contains some type of modification to 
either the base, sugar, or phosphate moieties: Modifications to nucleotides are well known in 
the art and would include for example, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, 
25 xanthine, hypoxanthine, and 2-aminoadenine as well as modifications at the sugar or phosphate 
moieties. 

[ 1 57] Nucleotide substitutes are molecules having similar functional properties to 
nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). 
Nucleotide substitutes are molecules that will recognize nucleic acids in a Watson-Crick or 
30 Hoogsteen manner, but which are linked together through a moiety other than a phosphate 
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moiety. Nucleotide substitutes are able to conform to a double helix type structure when 
interacting with the appropriate target nucleic acid. 

[158] It is also possible to link other types of molecules (conjugates) to nucleotides or 
nucleotide analogs to enhance for example, cellular uptake. Conjugates can be chemically 
linked to the nucleotide or nucleotide analogs. Such conjugates include but are not limited to 
lipid moieties such as a cholesterol moiety. (Letsinger et aL, Proc. Natl Acad. Set USA, 
1989,86, 6553-6556), 

[159] A Watson-Crick interaction is at least one interaction with the Watson-Crick face 
of a nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of a 
nucleotide, nucleotide analog, or nucleotide substitute includes the C2, Nl, and C6 positions of 
a purine based nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3, C4 
positions of a pyrimidine based nucleotide, nucleotide analog, or nucleotide substitute. 

[160] A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face 
of a nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The 
Hoogsteen face includes the N7 position and reactive groups (NH2 or O) at the C6 position of 
purine nucleotides. 

a) Sequences 

[161] There are a variety of sequences related to ji-opioid receptor and promoter 
sequences. These sequences and others are herein incorporated by reference in their entireties as 
well as for individual subsequences contained therein. It is understood that there are numerous 
Genbank accession sequences related to ^-opioid receptor, all of which are incorporated by 
reference herein. 

[162] One particular sequence set forth in SEQ ID NO:2 , which is a sequence for 
human fx-opioid receptor cDNA, is used herein, as an example, to exemplify the disclosed 
compositions and methods. It is understood that the description related to this sequence is 
applicable to any sequence related to ^-opioid receptor unless specifically indicated otherwise. 
Those of skill in the art understand how to resolve sequence discrepancies and differences and 
to adjust the compositions and methods relating to a particular sequence to other related 
sequences. Primers and/or probes can be designed for any of the sequences disclosed herein 
given the information disclosed herein and that known in the art. 
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[163] It is also understood for example that there are numerous vectors that can be used 
to create the p,-opioid receptor construct nucleic acids. 

b) Primers and probes 

[ 1 64] Disclosed are compositions including primers and probes, which are capable of 
5 interacting with, for example, the ji-opioid receptor construct nucleic acids, as disclosed herein. 
In certain embodiments the primers are used to support DNA amplification reactions. 
Typically the primers will be capable of being extended in a sequence specific maimer. 
Extension of a primer in a sequence specific manner includes any methods wherein the 
sequence and/or composition of the nucleic acid molecule to which the primer is hybridized or 

10 otherwise associated directs or influences the composition or sequence of the product produced 
by the extension of the primer. Extension of the primer in a sequence specific manner therefore 
includes, but is not limited to, PCR, DNA sequencing, DNA extension, DNA polymerization, 
RNA transcription, or reverse transcription. Techniques and conditions that amplify the primer 
in a sequence specific manner are preferred. In certain embodiments the primers are used for 

15 the DNA amplification reactions, such as PCR or direct sequencing. It is understood that in 
certain embodiments the primers can also be extended using non-enzymatic techniques, where 
for example, the nucleotides or oligonucleotides used to extend the primer are modified such 
that they will chemically react to extend the primer in a sequence specific manner. Typically the 
disclosed primers hybridize with, for example, the ji-opioid receptor construct nucleic acid, or 

20 region of the ^-opioid receptor construct nucleic acids or they hybridize with the complement of 
the ^-opioid receptor construct nucleic acids or complement of a region of the ^-opioid receptor 
construct nucleic acids. 

8. Peptides 

a) Protein variants 

25 [1 65] As discussed herein there are numerous variants of the ^-opioid receptor protein 

that are known and herein contemplated. In addition, to the known functional species and 
allelic variants of ^-opioid receptor there are derivatives of the p.-opioid receptor protein which 
also function in the disclosed methods and compositions. Protein variants and derivatives are 
well understood to those of skill in the art and in can involve amino acid sequence 

30 modifications. For example, amino acid sequence modifications typically fall into one or more 
of three classes: substitutional, insertional or deletional variants. Insertions include amino 
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and/or carboxyl terminal fusions as well as intrasequence insertions of single or multiple amino 
acid residues. Insertions ordinarily will be smaller insertions than those of amino or carboxyl 
terminal fusions, for example, on the order of one to four residues. Immunogenic fusion protein 
derivatives, such as those described in the examples, are made by fusing a polypeptide 
5 sufficiently large to confer immunogenicity to the target sequence by cross-linking in vitro or by 
recombinant cell culture transformed with DNA encoding the fusion. Deletions are 
characterized by the removal of one or more amino acid residues from the protein sequence. 
Typically, no more than about from 2 to 6 residues are deleted at any one site within the protein 
molecule. These variants ordinarily are prepared by site specific mutagenesis of nucleotides in 

10 the DNA encoding the protein, thereby producing DNA encoding the variant, and thereafter 
expressing the DNA in recombinant cell culture. Techniques for making substitution mutations 
at predetermined sites in DNA having a known sequence are well known, for example M l 3 
primer mutagenesis and PCR mutagenesis. Amino acid substitutions are typically of single 
residues, but can occur at a number of different locations at once; insertions usually will be on 

15 the order of about from 1 to 10 amino acid residues; and deletions will range about from 1 to 30 . 
residues. Deletions or insertions preferably are made in adjacent pairs, i.e. a deletion of 2 
residues or insertion of 2 residues. Substitutions, deletions, insertions or any combination 
thereof can be combined to arrive at a final construct The mutations must not place the 
sequence out of reading frame and preferably will not create complementary regions that could 

20 produce secondary mRNA structure. Substitutional variants are those in which at least one 
residue has been removed and a different residue inserted in its place. Such substitutions 
generally are made in accordance with the following Tables 4 and 5 and are referred to as 
conservative substitutions. 

[166] TABLE 4:Amino Acid Abbreviations 



Amino Add 


Abbreviations 


alanine 


AlaA 


allosoleucine 


Aile 


arginine 


ArgR 


asparagine 


AsnN 


aspartic acid 


AspD 


cysteine 


CysC 


glutamic acid 


GluE 


glutamine 


GlnK 


glycine 


GlyG 


histidine 


ffisH 


isolelucine 


Ilel 


leucine 


LeuL 
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Amino Acid 


Abbreviations 


lysine 


LysK 


phenylalanine 


PheF 


nroline 


Prop 


pyroglutamic 
acidp 


Glu 


serine 


SerS 


threonine 


ThrT 


tyrosine 


TyrY 


tryptophan 


TrpW 


valine 


ValV 



TABLE 5:Amino Acid Substitutions 


Original Residue Exemplary Conservative Substitutions, others are known 


in the art 




Ala 


ser 


Arg 


lys, gin 


Asn 


gin; his 


Asp 


glu 


Cys 


ser 


Gin 


asn, lys 


Glu 


asp 


Gly 


pro 


His 


asn;gln 


lie 


leu; val 


Leu 


ile; val 


Lys 


arg; gin; 


Met 


Leu; ile 


Phe 


met; leu; tyr 


Ser 


thr 


Thr ser 


Trp 


tyr 


Tyr 


trp; phe 


Val 


ile; leu 



[ 1 67] Substantial changes in function or immunological identity are made by selecting 
substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ 
more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in 
the area of the substitution, for example as a sheet or helical conformation, (b) the charge or 
hydtophobicity of the molecule at the target site or (c) the bulk of the side chain. The 
substitutions which in general are expected to produce the greatest changes in the protein 
properties will be those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted 
for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a 
cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
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electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, 
e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this 
case, (e) by increasing the number of sites for sulfation and/or glycosylation. 

5 [168] For example, the replacement of one amino acid residue with another that is 

biologically and/or chemically similar is known to those skilled in the art as a conservative 
substitution. For example, a conservative substitution would be replacing one hydrophobic 
residue for another, or one polar residue for another. The substitutions include combinations 
such as, for example, Gly, Ala; Val, lie, Leu; Asp, Glu; Asn, Gin; Ser, Thr, Lys, Arg; and Phe, 
10 Tyr. Such conservatively substituted variations of each explicitly disclosed sequence are 
included within the mosaic polypeptides provided herein. 

[169] Substitutional or deletional mutagenesis can be employed to insert sites for N- 
glycosylation (Asn-X-Thi/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other 
labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, 
1 5 e.g. Arg, is accomplished for example by deleting one of the basic residues or substituting one 
by glutaminyl or histidyl residues. 

[1 70] Certain post-translational derivatizations are the result of the action of 
recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are 
frequently post-translationally deamidated to the corresponding glutamyl and asparyl residues. 

20 Alternatively, these residues are deamidated under mildly acidic conditions. Other post- 
translational modifications include hydroxylation of proline and lysine, phosphorylation of 
hydroxyl groups of seryl or threonyl residues, methylation of the o-amino groups of lysine, 
arginine, and histidine side chains (T.E. Creighton, Proteins: Structure and Molecular 
Properties, W. H. Freeman & Co., San Francisco pp 79-86 (1983)), acetylation of the N- 

25 terminal amine and, in some instances, amidation of the C-terminal carboxyl. 

[171] It is understood that one way to define the variants and derivatives of the 
disclosed proteins herein is through defining the variants and derivatives in terms of 
homology/identity to specific known sequences. For example, SEQ ID NO:l sets forth a 
particular sequence of ^-opioid receptor. Specifically disclosed are variants of these and other 
30 proteins herein disclosed which have at least, 70% or 75% or 80% or 85% or 90% or 95% 

homology to the stated sequence. Those of skill in the art readily understand how to determine 
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the homology of two proteins. For example, the homology can be calculated after aligning the 
two sequences so that the homology is at its highest level. 

[ 1 72] Another way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison can be conducted by the local homology 
5 algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology 

alignment algorithm of Needleman and Wunsch, J. MoLBioL 48: 443 (1970), by the search 
for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85:2444(1988), 
by computerized implementations of these algorithms (GAP, BESTFTT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
1 0 Madison, WI), or by inspection. 

[1 73] The same types of homology can be obtained for nucleic acids by for example 
the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl Acad. 
Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol 183:281-306, 1989 which are 
. herein incorporated by reference for at least material related to nucleic acid alignment ■ 

15 [1 74] It is understood that the description of conservative mutations and homology can 

be combined together in any combination, such as embodiments that have at least 70% 
homology to a particular sequence wherein the variants are conservative mutations. 

[1 75] As this specification discusses various proteins and protein sequences it is 
understood that the nucleic acids that can encode those protein sequences are also disclosed. 

20 This would include all degenerate sequences related to a specific protein sequence, i.e. all 
nucleic acids having a sequence that encodes one particular protein sequence as well as all 
nucleic acids, including degenerate nucleic acids, encoding the disclosed variants and 
derivatives of the protein sequences. Thus, while each particular nucleic acid sequence may not 
be written out herein, it is understood that each and every sequence is in fact disclosed and 

25 described herein through the disclosed protein sequence. For example, one of the many nucleic 
acid sequences that can encode the protein sequence set forth in SEQ ID NO:3 is set forth in 
SEQ ID NO:4 Another nucleic acid sequence that encodes the same protein sequence set forth 
in SEQ ID NO:3 is set forth in SEQ ID NO:8. In addition, for example, a disclosed 
conservative derivative of SEQ ID NO:3 is shown in SEQ ID NO: 9, where the valine (V) at 

30 position 21 is changed to an isoleucine (I). It is understood that for this mutation, all of the 
nucleic acid sequences that encode this particular derivative of the SEQ ID NO:3 polypeptide 
are also disclosed. It is also understood that while no amino acid sequence indicates what 
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particular DNA sequence encodes that protein within an organism, where particular variants of a 
disclosed protein are disclosed herein, the known nucleic acid sequence that encodes that 
protein in the particular organism from which that protein arises is also known and herein 
disclosed and described. 

5 [176] It is understood that there are numerous amino acid and peptide analogs which 

can be incorporated into the disclosed compositions. For example, there are numerous D amino 
acids or amino acids which have a different functional substituent then the amino acids shown 
in Table 1 and Table 2. The opposite stereo isomers of naturally occurring peptides are 
disclosed, as well as the stereo isomers of peptide analogs. These amino acids can readily be 

1 0 incorporated into polypeptide chains by charging tRNA molecules with the amino acid of choice 
and engineering genetic constructs that utilize, for example, amber codons, to insert the analog 
amino acid into a peptide chain in a site specific way (Thorson et al., Methods in Molec. Biol. 
77:43-73 (1991), Zoller, Current Opinion in Biotechnology, 3:348-354 (1992); Ibba, 
Biotechnology & Genetic Enginerring Reviews 13: 197-216 (1995), Cahill et al., TIBS, 

15 14(10):400-403 (1989); Benner, TIB Tech, 12:158-163 (1994); Ibba and Hennecke, 

Bio/technology, 12:678-682 (1994) all of which are herein incorporated by reference at least for 
material related to amino acid analogs). 

[177] Molecules can be produced that resemble peptides, but which are not connected 
via a natural peptide linkage. For example, linkages for amino acids or amino acid analogs can 

20 include CH 2 NH~, -CH 2 S~, ~CH 2 -CH 2 -, -CH=CH- (cis and trans), ~COCH 2 -, - 

CH(OH)CH 2 ~, and -CHH 2 SO — (These and others can be found in Spatola, A. F. in Chemistry 
and Biochemistry of Amino Acids, Peptides, and Proteins, B. Weinstein, eds., Marcel Dekker, 
New York, p. 267 (1983); Spatola, A. F., Vega Data (March 1983), Vol. 1, Issue 3, Peptide 
Backbone Modifications (general review); Morley, Trends Pharm Sci (1980) pp. 463-468; 

25 Hudson, D. et al., Int J Pept Prot Res 14:177-185 (1979) (-CH 2 NH— , CH 2 CH 2 -); Spatola et al. 
Life Sci 38:1243-1249 (1986) (— CH H 2 -S); Harm J. Chem. Soc Perkin Trans. 1 307-314 (1982) 
(-CH-CH-, cis and trans); Almquist et al. J. Med. Chem. 23:1392-1398 (1980) (-COCH 2 -); 
Jennings-White et al. Tetrahedron Lett 23:2533 (1982) (~COCH 2 -); Szelke et al. European 
Appln, EP 45665 CA (1982): 97:39405 (1982) (-CH(OH)CH 2 -); Holladay et al. Tetrahedron. 

30 Lett 24:4401-4404 (1983) (-C(OH)CH 2 ~); and Hruby Life Sci 31:189-199 (1982) (-CH 2 -S-); 
each of which is incorporated herein by reference. A particularly preferred non-peptide linkage 
is -CH 2 NH-. It is understood that peptide analogs can have more than one atom between the 
bond atoms, such as b-alanine, g-aminobutyric acid, and the like. 
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[178] Amino acid analogs and analogs and peptide analogs often have enhanced or 
desirable properties, such as, more economical production, greater chemical stability, enhanced 
pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered specificity 
(e.g., a broad-spectrum of biological activities), reduced antigenicity, and others. 
5 [179] D-amino acids can be used to generate more stable peptides, because D amino 

acids are not recognized by peptidases and such. Systematic substitution of one or more amino 
acids of a consensus sequence with a D-amino acid of the same type (e.g., D-lysine in place of 
L-lysine) can be used to generate more stable peptides. Cysteine residues can be used to cyclize 
or attach two or more peptides together. This can be beneficial to constrain peptides into 
10 particular conformations. (Rizo and Gierasch Ann. Rev. Biochem. 61:387 (1992), incorporated 
herein by reference). 

[180] 

9. Pharmaceutical carriers/Delivery of pharmaceutical products 

[181] As described above, the compositions can also be administered in vivo in a 
15 pharmaceutical^ acceptable carrier. By "pharmaceutically acceptable" is meant a material that 
is not biologically or otherwise undesirable, i.e., the material can be administered to a subject, 
along with the nucleic acid or vector, without causing any undesirable biological effects or 
interacting in a deleterious manner with any of the other components of the pharmaceutical 
composition in which it is contained. The carrier would naturally be selected to minimize any 
20 degradation of the active ingredient and to minimize any adverse side effects in the subject, as 
would be well known to one of skill in the art 

[1 82] The compositions can be administered orally, parenterally (e.g., intravenously), 
by intramuscular injection, by intraperitoneal injection, transdermally, extracorporeally, 
topically or the like, including topical intranasal administration or administration by inhalant 

25 As used herein, "topical intranasal administration" means delivery of the compositions into the 
nose and nasal passages through one or both of the nares and can comprise delivery by a 
spraying mechanism or droplet mechanism, or through aerosolization of the nucleic acid or 
vector. Administration of the compositions by inhalant can be through the nose or mouth via 
delivery by a spraying or droplet mechanism. Delivery can also be directly to any area of the 

30 respiratory system (e.g., lungs) via intubation. The exact amount of the compositions required 
will vary from subject to subject, depending on the species, age, weight and general condition of 
the subject, the severity of the allergic disorder being treated, the particular nucleic acid or 
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vector used, its mode of administration and the like. Thus, it is not possible to specify an exact 
amount for every composition. However, an appropriate amount can be determined by one of 
ordinary skill in the art using only routine experimentation given the teachings herein. 

[183] Parenteral administration of the composition, if used, is generally characterized 
5 by injection. Injectables can be prepared in conventional forms, either as liquid solutions or 
suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as 
emulsions. A more recently revised approach for parenteral administration involves use of a 
slow release or sustained release system such that a constant dosage is maintained. See, e.g., 
U.S. Patent No. 3,610,795, which is incorporated by reference herein. 

10 [ 1 84] The materials can be in solution, suspension (for example, incorporated into 

microparticles, liposomes, or cells). These can be targeted to a particular cell type via 
antibodies, receptors, or receptor ligands. The following references are examples of the use of 
this technology to target specific proteins to tumor tissue (Senter, et al., Bioconjugate Chem., 
2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer, 60:275-281, (1989); Bagshawe, et al., Br. J. 

15 Cancer, 58:700-703, (1988); Senter, et al., Bioconjugate Chem., 4:3-9, (1993); Battelli, et al., 
Cancer Immunol Immunother., 35:421-425, (1992); Pietersz and McKenzie, Immunolog. 
Reviews, 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol, 42:2062-2065, (1991)). 
Vehicles such as "stealth" and other antibody conjugated liposomes (including lipid mediated 
drug targeting to colonic carcinoma), receptor mediated targeting of DNA through cell specific 

20 ligands, lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting 
of murine glioma cells in vivo. The following references are examples of the use of this 
technology to target specific proteins to tumor tissue (Hughes et al., Cancer Research, 49:6214- 
6220, (1989); and Litzinger and Huang, Biochimica et Biophysica Acta, 1104:179-187, (1992)). 
In general, receptors are involved in pathways of endocytosis, either constitutive or ligand 

25 induced. These receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated 

vesicles, pass through an acidified endosome in which the receptors are sorted, and then either 
recycle to the cell surface, become stored intracellularly, or are degraded in lysosomes. The 
internalization pathways serve a variety of functions, such as nutrient uptake, removal of 
activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, 

30 dissociation and degradation of ligand, and receptor-level regulation. Many receptors follow 
more than one intracellular pathway, depending on the cell type, receptor concentration, type of 
ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms of 
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receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 
10:6,399^09(1991)). 

a) Pharmaceutical^ Acceptable Carriers 

[185] The compositions, including antibodies, can be used therapeutically in 
5 combination with a pharmaceutical^ acceptable carrier. 

[186] Suitable carriers and their formulations are described in Remington: The Science 
and Practice of Pharmacy (19thed.)ed. A.R. Gennaro, Mack Publishing Company, Easton, 
PA 1995. Typically, an appropriate amount of a phannaceutically-acceptable salt is used in the 
formulation to render the formulation isotonic. Examples of the phannaceutically-acceptable 

10 carrier include, but are not limited to, saline, Ringer's solution and dextrose solution. The pH of 
the solution is preferably from about 5 to about 8, and more preferably from about 7 to about 
7.5. Further carriers include sustained release preparations such as semipermeable matrices of 
solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped 
articles, e.g., films, liposomes or microparticles. It will be apparent to those persons skilled in 

15 the art that certain carriers can be more preferable depending upon, for instance, the route of 
administration and concentration of composition being administered. 

[1 87] Pharmaceutical carriers are known to those skilled in the art. These most 
typically would be standard carriers for administration of drugs to humans, including solutions 
such as sterile water, saline, and buffered solutions at physiological pH. The compositions can 
20 be administered intramuscularly or subcutaneously. Other compounds will be administered 
according to standard procedures used by those skilled in the art. 

[1 88] Pharmaceutical compositions can include carriers, thickeners, diluents, buffers, 
preservatives, surface active agents and the like in addition to the molecule of choice. 
Pharmaceutical compositions can also include one or more active ingredients such as antimicrobial 
25 agents, antiinflammatory agents, anesthetics, and the like. 

[189] The pharmaceutical composition can be administered in a number of ways 
depending on whether local or systemic treatment is desired, and on the area to be treated. 
Administration can be topically (including ophthalmicaUy, vaginally, rectally, intranasally), orally, 
by inhalation, or parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or 
30 intramuscular injection. The disclosed antibodies can be administered intravenously, 
intraperitoneally, intramuscularly, subcutaneously, intracavity, or transdennally. 
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[190] Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
5 including saline and buffered media. Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishes, electrolyte replenishes (such as those based on 
Ringer's dextrose), and the like. Preservatives and other additives can also be present such as, 
for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

10 [191] Formulations for topical administration can include ointments, lotions, creams, 

gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, 
aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. 

[192] Compositions for oral administration include powders or granules, suspensions or 
solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, 
1 5 diluents, emulsifiers, dispersing aids or binders may be desirable. 

[193] Some of the compositions can potentially be administered as a pharmaceutically 
acceptable acid- or base- addition salt, formed by reaction with inorganic acids such as 
hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, 
and phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, glycolic 
20 acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric 
acid, or by reaction with an inorganic base such as sodium hydroxide, ammonium hydroxide, 
potassium hydroxide, and organic bases such as mono-, di-, trialkyl and aryl amines and 
substituted ethanolamines. 

10. Chips and micro arrays 

25 [1 94] Disclosed are chips where at least one address is the sequences or part of the 

sequences set forth in any of the nucleic acid sequences disclosed herein. Also disclosed are 
chips where at least one address is the sequences or portion of sequences set forth in any of the 
peptide sequences disclosed herein. 

[195] Also disclosed are chips where at least one address is a variant of the sequences 
30 or part of the sequences set forth in any of the nucleic acid sequences disclosed herein. Also 
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disclosed are chips where at least one address is a variant of the sequences or portion of 
sequences set forth in any of the peptide sequences disclosed herein. 

11. Computer readable mediums 

[1 96] It is understood that the disclosed nucleic acids and proteins can be represented 
5 as a sequence consisting of the nucleotides of amino acids. There are a variety of ways to 
display these sequences, for example the nucleotide guanosine can be represented by G or g. 
Likewise the amino acid valine can be represented by Val or V. Those of skill in the art 
understand how to display and express any nucleic acid or protein sequence in any of the variety 
of ways that exist, each of which is considered herein disclosed. Specifically contemplated 

10 herein is the display of these sequences on computer readable mediums, such as, commercially 
available floppy disks, tapes, chips, hard drives, compact disks, and video disks, or other 
computer readable mediums. Also disclosed are the binary code representations of the disclosed 
sequences. Those of skill in the art understand what computer readable mediums. Thus, 
computer readable mediums on which the nucleic acids or protein sequences are recorded, 

15 stored, or saved. 

[197] Disclosed are computer readable mediums comprising the sequences and 
information regarding the sequences set forth herein. 

12. Kits 

[198] Disclosed herein are kits that are drawn to reagents that can be used in practicing 
20 the methods disclosed herein. The kits can include any reagent or combination of reagent 

discussed herein or that would be understood to be required or beneficial in the practice of the 
disclosed methods. For example, the kits could include primers to perform the amplification 
reactions discussed in certain embodiments of the methods, as well as the buffers and enzymes 
required to use the primers as intended. 
25 D. Methods of making the compositions 

[1 99] The compositions disclosed herein and the compositions necessary to perform 
the disclosed methods can be made using any method known to those of skill in the art for that 
particular reagent or compound unless otherwise specifically noted. 

[200] The disclosed viral vectors can be made using standard recombinant molecular 
30 biology techniques. Many of these techniques are illustrated in Maniatis (Maniatis et al., 

"Molecular Cloning-A Laboratory Manual," (Cold Spring Harbor Laboratory, Latest edition) 
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and Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989. 

1. Nucleic acid synthesis 

[201] For example, the nucleic acids, such as, the oligonucleotides to be used as 
primers can be made using standard chemical synthesis methods or can be produced using 
enzymatic methods or any other known method. Such methods can range from standard 
enzymatic digestion followed by nucleotide fragment isolation (see for example, Sambrook et 
al, Molecular Cloning: A Laboratory Manual, 2nd Edition (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., 1989) Chapters 5, 6) to purely synthetic methods, for example, 
by the cyanoethyl phosphoramidite method using a Milligen or Beckman System Plus DNA 
synthesizer (for example, Model 8700 automated synthesizer of Milligen-Biosearch, Burlington, 
MA or ABI Model 380B). Synthetic methods useful for making oligonucleotides are also 
described by Heutaef al, Ann. Rev. Biochenu 53:323-356 (1984), (phosphotriester and 
phosphite-triester methods), and Narang et al 9 Methods EnzymoL, 65:610-620 (1980), 
(phosphotriester method). Protein nucleic acid molecules can be made using known methods 
such as those described by Nielsen et al, Bioconjug. Chem. 5:3-7 (1994). 

2. Peptide synthesis 

[202] One method of producing the disclosed proteins is to link two or more peptides 
or polypeptides together by protein chemistry techniques. For example, peptides or 
polypeptides can be chemically synthesized using currently available laboratory equipment 
using either Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. 
(Applied Biosystems, Inc., Foster City, CA). One skilled in the art can readily appreciate that a 
peptide or polypeptide corresponding to the disclosed proteins, for example, can be synthesized 
by standard chemical reactions. For example, a peptide or polypeptide can be synthesized and 
not cleaved from its synthesis resin whereas the other fragment of a peptide or protein can be 
synthesized and subsequently cleaved from the resin, thereby exposing a terminal group which 
is functionally blocked on the other fragment By peptide condensation reactions, these two 
fragments can be covalently joined via a peptide bond at their carboxyl and amino termini, 
respectively, to form an antibody, or fragment thereof. (Grant GA (1992) Synthetic Peptides: A 
User Guide. W.H. Freeman and Co., N.Y. (1992);BodanskyMandTrostB.,Ed. (1993) 
Principles of Peptide Synthesis. Springer-Verlag Inc., NY (which is herein incorporated by 
reference at least for material related to peptide synthesis). Alternatively, the peptide or 
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polypeptide is independently synthesized in vivo as described herein. Once isolated, these 
independent peptides or polypeptides can be linked to form a peptide or fragment thereof via 
similar peptide condensation reactions. 

[203] For example, enzymatic ligation of cloned or synthetic peptide segments allow 
5 relatively short peptide fragments to be joined to produce larger peptide fragments, polypeptides 
or whole protein domains (Abrahmsen L et al., Biochemistry, 30:4151 (1991)). Alternatively, 
native chemical ligation of synthetic peptides can be utilized to synthetically construct large 
peptides or polypeptides from shorter peptide fragments. This method consists of a two step 
chemical reaction (Dawson et al. Synthesis of Proteins by Native Chemical Ligation. Science, 
10 266:776-779 (1994)). The first step is the chemoselective reaction of an unprotected synthetic 
peptide-thioester with another unprotected peptide segment containing an ammo-terminal Cys 
residue to give a thioester-linked intermediate as the initial covalent product Without a change 
in the reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular 
reaction to form a native peptide bond at the ligation site (Baggiolini M et al. (1 992) FEBSLett. 
15 307:97-101 ; Clark-Lewis I et al., J.BioLChem. 9 269:16075 (1994); Clark-Lewis I et al., 
Biochemistry, 30:3128 (1991); Rajarathnam K et al., Biochemistry 33:6623-30 (1994)). 

[204] Alternatively, unprotected peptide segments are chemically linked where the 
bond formed between the peptide segments as a result of the chemical ligation is an unnatural 
(non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)). This technique has been 
20 used to synthesize analogs of protein domains as well as large amounts of relatively pure 

proteins with full biological activity (deLisle Milton RC et al., Techniques in Protein Chemistry 
IV. Academic Press, New York, pp. 257-267(1992)). 

3. Processes for making the compositions 

[205] Disclosed are processes for making the compositions as well as ma k ing the 
25 intermediates leading to the compositions. There are a variety of methods that can be used for 
making these compositions, such as synthetic chemical methods and standard molecular biology 
methods. It is understood that the methods of making these and the other disclosed 
compositions are specifically disclosed. 

[206] Disclosed are nucleic acid molecules produced by the process comprising linking 
30 in an operative way a promoter element and a fi-opioid receptor element. 
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[207] Disclosed are nucleic acid molecules produced by the process comprising linking 
in an operative way nucleic acid molecules comprising sequences set forth in SEQ ID NO:2 and 
SEQ ID NO:4. 

[208] Also disclosed are nucleic acid molecules produced by the process comprising 
5 linking in an operative way nucleic acid molecules comprising sequences having 80% identity 
to sequences set forth in SEQ ID NO:2 and SEQ ID NO:4. 

[209] Also disclosed are nucleic acid molecules produced by the process comprising 
linking in an operative way nucleic acid molecules comprising sequences that hybridizes under 
stringent hybridization conditions to sequences set forth in SEQ ID NO:2 and SEQ ID NO:4. 

10 [210] Disclosed are nucleic acid molecules produced by the process comprising linking 

in an operative way a nucleic acid molecule comprising a sequence encoding a ^.-opioid receptor 
peptide and a sequence controlling an expression of the sequence encoding the ji-opioid receptor 
peptide. 

[211] Disclosed are nucleic acid molecules produced by the process comprising linking 
15 in an operative way a nucleic acid molecule comprising a sequence encoding a fi-opioid receptor 
peptide wherein the ji-opioid receptor peptide has 80% identity to the peptide set forth in SEQ 
ID NO:l or 3 and a sequence controlling expression of the sequences encoding the peptide. 

[212] Disclosed are nucleic acid molecules produced by the process comprising linking 
in an operative way a nucleic acid molecule comprising a sequence encoding a ^-opioid receptor 
20 peptide wherein the fi-opioid receptor peptide has 80% identity to the peptides set forth in SEQ 
ID NO: 1 or 3 and, wherein any change from the sequences set forth in SEQ ID NO: 1 or 3 are 
conservative changes and a sequence controlling expression of the sequences encoding the 
peptide. 

[213] Disclosed are cells produced by the process of transforming the cell with any of 
25 the disclosed nucleic acids. Disclosed are cells produced by the process of transforming the cell 
with any of the non-naturally occurring disclosed nucleic acids. 

[2 1 4] Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the disclosed nucleic acids. Disclosed are any of the non-naturally occurring 
disclosed peptides produced by the process of expressing any of the disclosed nucleic acids. 
30 Disclosed are any of the disclosed peptides produced by the process of expressing any of the 
non-naturally disclosed nucleic acids. 
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[2 1 5] Disclosed are animals produced by the process of transfecting a cell within the 
animal with any of the nucleic acid molecules disclosed herein. Disclosed are animals produced 
by the process of transfecting a cell within the animal any of the nucleic acid molecules 
disclosed herein, wherein the animal is a mammal. Also disclosed are animals produced by the 
5 process of transfecting a cell within the animal any of the nucleic acid molecules disclosed 
herein, wherein the mammal is mouse, rat, rabbit, cow, sheep, pig, or primate. Also disclosed 
are non human primates and non-human mammals. 

[216] Also disclose are animals produced by the process of adding to the animal any of 
the cells disclosed herein. 
10 E. Methods of using the compositions 

1. Methods of using the compositions as research tools 

[217] The disclosed compositions can be used in a variety of ways as research tools. 
For example, the disclosed compositions, the (i-opioid receptor constructs, and other nucleic 
acids, such as SEQ ID NOs:2 and 4 can be used to produce organisms, such as transgenic or 
15 knockout mice, which can be used as model systems for the study of pain. 

2. Methods of gene modification and gene disruption 

[218] The disclosed compositions and methods can be used for targeted gene 
disruption and modification in any animal that can undergo these events. Gene modification 
and gene disruption refer to the methods, techniques, and compositions that surround the 

20 selective removal or alteration of a gene or stretch of chromosome in an animal, such as a 
mammal, in a way that propagates the modification through the germ line of the mammal. In 
general, a cell is transformed with a vector which is designed to homologously recombine with a 
region of a particular chromosome contained within the cell, as for example, described herein. 
This homologous recombination event can produce a chromosome which has exogenous DNA 

25 introduced, for example in frame, with the surrounding DNA. This type of protocol allows for 
very specific mutations, such as point mutations, to be introduced into the genome contained 
within the cell. Methods for performing this type of homologous recombination are disclosed 
herein. 

[219] One of the preferred characteristics of performing homologous recombination in 
30 mammalian cells is that the cells should be able to be cultured, because the desired 
recombination event occurs at a low frequency. 
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[220] Once the cell is produced through the methods described herein, an animal can 
be produced from this cell through either stem cell technology or cloning technology. For 
example, if the cell into which the nucleic acid was transfected was a stem cell for the organism, 
then this cell, after transfection and culturing, can be used to produce an organism which will 
5 contain the gene modification or disruption in germ line cells, which can then in turn be used to 
produce another animal that possesses the gene modification or disruption in all of its cells. In 
other methods for production of an animal containing the gene modification or disruption in all 
of its cells, cloning technologies can be used. These technologies generally take the nucleus of 
the transfected cell and either through fusion or replacement fiise the transfected nucleus with an 
10 oocyte which can then be manipulated to produce an animal. The advantage of procedures that 
use cloning instead of ES technology is that cells other than ES cells can be transfected. For 
example, a fibroblast cell, which is very easy to culture can be used as the cell which is 
transfected and has a gene modification or disruption event take place, and then cells derived 
from this cell can be used to clone a whole animal. 

15 3. Therapeutic Uses 

[221] Effective dosages and schedules for administering the compositions can be 
determined empirically, and making such determinations is within the skill in the art The 
dosage ranges for the administration of the compositions are those large enough to produce the 
desired effect in which the symptoms disorder are effected. The dosage should not be so large 

20 as to cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, and 
the like. Generally, the dosage will vary with the age, condition, sex and extent of the disease in 
the patient, route of administration, or whether other drugs are included in the regimen, and can 
be determined by one of skill in the art The dosage can be adjusted by the individual physician 
in the event of any counterindications. Dosage can vary, and can be administered in one or 

25 more dose administrations daily, for one or several days. Guidance can be found in the 
literature for appropriate dosages for given classes of pharmaceutical products. 

[222] Following administration of a disclosed composition, such as the disclosed 
constructs, for treating, inhibiting, or preventing pain, the efficacy of the therapeutic construct 
can be assessed in various ways well known to the skilled practitioner. For instance, one of 
30 ordinary skill in the art will understand that a composition, such as the disclosed constructs, 
disclosed herein is efficacious in treating pain or inhibiting or reducing the effects of pain in a 
subject by observing that the composition reduces the onset of the conditions associated with 
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these diseases. Furthermore, the amount of protein or transcript produced from the constructs 
can be analyzed using any diagnostic method. For example, it can be measured using 
polymerase chain reaction assays to detect the presence of construct nucleic acid or antibody 
assays to detect the presence of protein produced from the construct in a sample (e.g., but not 
5 limited to, blood or other cells, such as neural cells) from a subject or patient, or it can be 

measured by any of the methods disclosed herein for monitoring non-human pain, and through 
communication for human pain. 

[223] 

F. Examples 

1 0 [224] It will be apparent to those skilled in the art that various modifications and 

variations can be made in the present invention without departing from the scope or spirit of the 
invention. Other embodiments of the invention will be apparent to those skilled in the art from 
consideration of the specification and practice of the invention disclosed herein. It is intended 
that the specification and examples be considered as exemplary only, with a true scope and 

15 spirit of the invention being indicated by the following claims. 

[225] The following examples are put forth so as to provide those of ordinary skill in 
the art with a complete disclosure and description of how the compounds, compositions, 
articles, devices and/or methods claimed herein are made and evaluated, and are intended to be 
purely exemplary of the invention and are not intended to limit the scope of what the inventors 
20 regard as their invention. Efforts have been made to ensure accuracy with respect to numbers 
(e.g., amounts, temperature, etc.), but some errors and deviations should be accounted for. 
Unless indicated otherwise, parts are parts by weight, temperature is in °C or is at ambient 
temperature, and pressure is at or near atmospheric. 

1. Example 1 Orofacial pain 

25 a) Methods and approaches 

\226] Expression of the |>opioid receptor in neurons involved in the central processing 
of orofacial pain will result in attenuation of nociception. FIV(n-opioid receptor), a lentivirus 
capable of stably transducing terminally differentiated cells (neurons) with ji-opioid receptor, 
can be peripherally administered at orofacial sites, including the TMJ and the masseter muscle, 
30 exposed to nociceptive substances. The efficacy of the therapy can be assessed by assaying 
changes in muscle EMG as well as resistance to jaw opening. 
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(1) Construct FIV(HUMOR) and confirm its ability to 
transduce cells of neuronal origin 

[227] Thfe opening reading frame of the HUMOR cDNA having the sequence set forth 
in SEQ ID NO:2 was (Raynor K, et al., J Pharmacol Exp 7*erJ995; 272:423-8) cloned into 
5 the pFIV vector using standard molecular biology methods generating pFIV(HUMOR) SEQID 
NO: 7 (See Figure 4). Constructs having two different promoters were made to drive the 
expression of HUMOR, the neuron specific enolase (NSE) promoter as well as the 
cytomegalovirus (CMV) promoter. The NSE promoter limits the expression of HUMOR 
selectively in neurons at constitutive levels, whereas the CMV promoter results in ubiquitous 

10 levels of expression independent of cell type. Clones were selected following DNA isolation 
and confirmation by multiple restriction digestions and direct DNA sequencing of both strands. 
pFIV(HUMOR) can be co-transfected with pPAK and pVSV-G vectors in 293H cells utilizing 
the Lipofectamine 2000® reagent (Gibco/BRL-Invitrogen) for virus production as previously 
described (Figure 3). Following 60 hours incubation, the supernatant will be collected, filtered 

15 and titered. Titers of 10 5 ip/ml are typical using this method. If required, the titers can be 
increased by ultra-centrifiigation of the supernatant and re-suspension in normal saline. 

[228] FIV(NSE-HUMOR) as well as FIV(CMV-HUMOR) efficacy can be tested on 
the N2a neuronal cell line, whereby cells can be seeded on 12 well plates and infected by 
FIV(HUMOR), followed by a 24 hour fresh media change. DNA, RNA and protein samples 

20 can be harvested using standard laboratory methods at 72 hours post infection. The presence of 
HUMOR gene copies in the infected samples can be assessed by Q-PCR, and the expression of 
HUMOR can be determined at the mRNA and protein levels by RT-PCR and western 
immunoblotting, respectively. In addition, other cells can be fixed by 4% paraformaldehyde for 
HUMOR protein expression detection and visualization by immunocytochemistry. Figure 1 or 

25 2 demonstrates the expression of HUMOR in N2a cells following transient transfection and 

detection of its expression by immunocytochemistry utilizing a commercially available antibody 
raised against HUMOR (Research & Diagnostic Antibodies, Benicia CA Cat# AS-3942S). The 
FIV vector can infect and stably transduce sensory trigeminal neurons following local peripheral 
administration as demonstrated by the data in Figure 3. At the animal behavioral level, the 

30 targeted expression of HUMOR in orofacial sensory neurons can result in attenuation of 

nociceptive symptoms that readily can be measured as change (decrease) in EMG activity and 
decrease in resistance to jaw opening. 
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(2) Targeted expression of jx-opioid receptor in orofacial 
somatosensory neurons can attenuate orofacial pain following 
administration of algesic substance 

[229] The feline immunodeficiency viral vectors FIV(NSE-HUMOR) and FIV(CMV- 
5 HUMOR) can be employed in the transduction of trigeminal sensory neurons with the |i-opioid 
receptor. Orofacial nociception can be quantitatively assessed by means of masticatory muscle 
activity, as measured by electromyography (EMG), as well as resistance to mouth opening, as 
measured by a digital force transducer. Table 6 summarizes the experimental conditions. In 
brief, FIV(NSE-HUMOR) and FIV(CMV-HUMOR) can be injected (50 \il of 10 7 ip/md) in the 

10 TMJ or masseter muscle of anesthetized animals, which can be returned to their cages after 
recovery. Five weeks post-treatment, EMG bipolar percutaneous hook electrodes can be 
inserted in the masseter and temporal muscles under anesthesia and an orthodontic Kobayashi 
hook can be bonded to the mandibular incisors for attachment to the digital force gauge. The 
animals can then be positioned in a custom made restraining device. Base line EMG and 

15 resistance to jaw opening measurements can be taken for every animal. Four channels of 

simultaneous EMG signal (right and left masseter and temporal muscles) and one channel for 
the digital force gauge can be recorded. An A/D conversion card (NI016E1, National 
Instruments) for the EMG signal and a digital force gauge (FGF series, Kernco Instruments) can 
be used for signal collection. A "blinded" observer can collect 10 seconds of EMG and digital 

20 force measurements at 5, 10 and 15 mm of vertical opening five times simultaneously. Data can 
be stored for later analysis. The appropriate animals can then be injected in the 
temporomandibular joint and masseter muscle area with the algesic agent glutamate (2.5 \xmol 
in saline) or vehicle solution (saline), and EMG and resistance to jaw opening recordings can be 
taken. Data can be analyzed by two-way ANOVA with repeated measures. Student's t-test can 

25 compare the two groups. These experiments replicate conditions seen in the human condition. 
(Molin C. 9 Acta Odontol Scand 1972; 30:485-99; MollerUM, et al., Scand J Dent Res 1984; 92: 
64-83; StoWer CS, et al., Helv Odontol Acta 1985; 29:13-20; Stohler CS, et al., Arch Oral Biol 
1988; 33: 175-82; Stohler CS. Temporomandibular disorders and related pain conditions. In: 
Sessle, B. J., Bryant, P.S. and Dionne, R.A. Editors, 1995. Progress in pain research and 

30 management Elsevier, Amsterdam, pp. 3-30; Lund JP, Stohler CS. Effect of pain on muscular 
activity in temporomandibular disorders and related conditions. In: Stohler, C.S., Carlson, D.S., 
editors. Biological and Psychological aspects of orofacial pain, craniomandibular growth series. 
Ann Arbor, MI: University of Michigan, 1994. pp. 75-91; HoshioT. Senior Thesis, Division 
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of Orthodontics, Eastman Department of Dentistry, University of Rochester School of Medicine 
and Dentistry. Rochester, New York; Balkhi KM, et aL, J Orofacial Pain 1993;7:89-97). 

[230] HUMOR expression can he confirmed following the experimental recordings in 
deeply anesthetized animals that can be sacrificed via trans-cardial perfusion of 4% 

5 paraformaldehyde, and the brain stem, trigeminal ganglia, masseter muscle and TMJ can be 
excised and collected for further analysis. Specifically, we can characterize the expression of 
HUMOR in the central and peripheral tissues utilizingjimmunocytochemistry as depicted by our 
preliminary data. Cell type specificity can be accomplished by double immunofluorescence, as 
previously demonstrated by our laboratory. Potential deleterious effects of HUMOR expression 

10 (or over-expression in the CMV driven gene) to neurons or other cells can be examined by 
Hoechst staining and confirmed by TUNEL (in the case of cell death) as well as by the 
expression of neuron specific housekeeping genes, such as neuron specific enolase as well 
substance P (expressed selectively by sensory neurons). 



G. TABLE 6 Summary of experimental conditions 



ALGESIC 
AGENT 


SITE of 
NOCICEPTION 


TREATMENT 


N 


Glutamate 
saline 


Temporomandibular 
Joint 
Masseter muscle 


FIV(NSE-HUMOR) 

FIV(CMV-HUMOR) 
morphine 
FIV(NSE-HUMOR)+opioid 
FIV(CMV-HUMOR)+opioid 
saline 


10 
each 



15 [23 1 ] Patients presenting with orofacial pain from TMJ disorders have been 

characterized as having decreased bite and chewing forces and limited jaw opening . For 
example, reduced grip strength (Norsdenskiold UM and Grimby G., Scand J Rheumatol 1993; 
22: 14-9.), sustained jaw closing pain (Jow RW and Clark GT., Arch Oral Biol 1989; 34: 857- 
62.) and reduced bite strength (Molin C, Acta Odontol Scand 1972; 30:485-99) have all been 

20 reported in patients with muscle pain. The reduction in muscle force exertion associated with 
myalgia has been suggested to be due to reduced activity of agonist muscles and increased 
activity of antagonist muscles (Molin C, Acta Odontol Scand 1 972; 30:485-99; Stohler CS, et 
al.,/fe/v Odontol Acta 1985; 29:13-20; Stohler CS, etaL, Arch Oral Biol 1988; 33: 175-82; 
Stohler CS. Temporomandibular disorders and related pain conditions. In: Sessle, B.J., Bryant, 

25 P.S. and Dionne, R.A. Editors, 1995. Progress in pain research and management Elsevier, 
Amsterdam, pp. 3-30; and Lund JP, Stohler CS. Effect of pain on muscular activity in 
temporomandibular disorders and related conditions. Li: Stohler, C.S., Carlson, D.S., editors. 



WO 2004/073646 



PCT/US2004/004914 



Biological and Psychological aspects of orofacial pain, craniomandibular growth series. Ann 
Arbor, MI: University of Michigan, 1994. pp. 75-91.) Keil et al., (Keil U, et al., Pain 2000; 
85:333-43) have demonstrated that foreliinb grip force reduction is a behavioral index of 
hyperalgesia in the carrageenan model of muscle hyperalgesia. This would translate to 
5 reduction of bite force and an increase in antagonist muscle activity in the orofacial region. 
These experiments in have been done in humans (Lund JP, Stohler CS. Effect of pain on 
muscular activity in temporomandibular disorders and related conditions. In: Stohler, C.S., 
Carlson, D.S., editors. Biological and Psychological aspects of orofacial pain, craniomandibular 
growtli series. Ann Arbor, MI: University of Michigan, 1994. pp. 75-91; HoshioT. Senior 

1 0 Thesis, Division of Orthodontics, Eastman Department of Dentistry, University of Rochester 
School of Medicine and Dentistry. Rochester, New York; and Balkhi KM, et al., J Orofacial 
Pain 1993;7:89-97.). Hoshio has also demonstrated that compared to controls, patients with 
TMD demonstrate decreased bite (201 and 223 mV for the masseter muscles respectively in 
asymptomatic volunteers and 128 and 153 mV for symptomatic patients). Balkhi (Balkhi KM, 

15 et al., J Orofacial Pain 1993;7:89-97) demonstrated that chewing force was decreased in patients 
with pain (113 and 102 mV for deliberate right and left side chewing of gum masseter muscles 
respectively in asymptomatic volunteers and 85 and 83 mV for symptomatic patients). It has 
been demonstrated that there was increased jaw muscle activity of antagonists during jaw 
opening.. These data clearly demonstrate that patients with TMD have similar characteristics 

20 and that decreased bite force and chewing activity are a reflection of somatic pain. 

[232] Mice (C57/B6) can be employed to make the transgenic mice. 

a) Injection of replication defective HV(HUMOR) vectors 

[233] The animals, such as pups, can be anesthetized with ketamine (60 mg/Kg) and 
xylazine (5 mg/Kg) IP. To verify the induction of surgical anesthesia, a toe is pinched in order 

25 to test for reflex withdrawal. One type of injection will be performed in two distinct areas. 
Under surgical plane of anesthesia the animals, such as mice, can be injected with 50jil of 10 7 
ip/ml of FIV(HUMOR) using a 1ml syringe with a 21 m gage needle directly into the 
temporomandibular joint and masseter muscle. The animals can be identified by ear punching. 
Animals can be held at the base of the tail with distal portion of tail situated on surface of 

30 nestlet, for example. Using a straight edge blade, -7mm of distal tail can be removed, and the 
mouse can be placed in a cage and the tail specimen stored in a vial labeled by mouse ID# and 
sex. The mice are euthanised with sodium pentobarbital (200 mg/kg). 
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[234] Fixation by intracardial transfusion can be performed. Upon exposure of the 
heart, the right atrium can be clipped and the left ventricle can be catheterized with a 17 gage 
needle through which 50ml of 4% paraformaldehyde solution in phosphate buffered saline can 
be transfused into the animal. The liver, spleen, kidney and brain can be dissected and post- 
5 fixed until sectioned for histology. The middle part of the cranium, including the cranial base 
(sphenoid, ethmoid, maxilla) can also be dissected, demineralized by immersion into an EDTA 
solution and section for histology. 

2. Example 2: Non-Primate Lentiviral Vector Administration in the TMJ 

[235] Diclosed herein are the effects lentiviral vectors on the temporomandibular joint 
1 0 Defective feline immunodeficiency virus capable of infecting dividing as well as terminally 

differentiated cells with the reporter gene lacZ, the expression of which was studied by means of 
PCR, X-gal histochemistry and 0-galactosidase immunocytochemistry were injected into the 
articular joint space. The results showed successful transduction of hard and soft tissues of the 
temporomandibular joint Interestingly, a subset of primary sensory neurons of the ipsilateral 
1 5 trigeminal ganglion also stained positive for the reporter gene, presumambly following uptake 
of the lentiviral vector by peripheral nerve fibers and retrograde transport to the nucleus. These 
findings indicate that transfer of anti-nociceptive genes, and disclosed herein, genes such as t he 
opiod receptors, can be transferred into nerve cells and relieve pain. For example, lentiviral 
vectors can serve as the platform for the transfer of anti-nociceptive genes for the management 
20 of temporomandibular joint pain. 

a) Materials and Methods 

(1) Vector Construction and Packaging 

[236] The defective, vesicular stomatitis (VSV-G) pseudotyped, feline 
immunodeficiency virus, FIV(lacZ), capable of transducing dividing, growth-arrested as well as 

25 post-mitotic cells (neurons) with the reporter gene lacZ driven by the ubiquitous 

cytomegalovirus promoter, CMV (Poeschla EM, et al. (1998). Nature Med 4: 354-357) was 
employed. The vectors were kindly donated to us by Dr. Wong-Staal, University of California at 
San Diego. A schematic description of the vector is depicted in Figure 6A. In addition, a 
control FIVfA'lac) vector carrying an inactive p-galactosidase was constructed by deleting the 

30 first 1,000 bp of the focZgene (3.75 kb in total), including the transcription initiation site (Fig. 
6A). Specifically, the FIV(lacZ) vector was digested in vitro with the Sstn and Cla I restriction 
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enzymes overnight at 37°C, followed by agarose gel purification. The ends of the backbone 
DNA were blunted with the T4 DNA polymerase (Invitrogen, Carlsbad CA) and ligated with T4 
ligase (Invitrogen) according to manufacturer's instructions. The FIV(lacZ) and FTV(A'lac) 
vectors were transiently co-transfected along with the packaging and VSV-G vectors into 293H 
5 cells (GBCO/BRL) cultured in DMEM (Invitrogen) plus 1 0% FBS (Gemini, Woodland CA) 
using the Lipofectamine 2000 reagent per manufacturer's instructions (Invitrogen), and followed 
by a fresh media change supplemented by non-essential amino acids (Invitrogen). Sixty hours 
post-transfection, the supernatant was collected, filtered through .45mm Surfil®-MF filter 
(Corning Seperations Division, Acton MA), aliquoted and frozen until further use. Titering was 
10 performed on CrfK cells (American Tissue Culture Collection; Manassas, VA) cultured in 24 
well tissue culture plates, and assessed at at 5xl0 7 blue forming units (bfu) / mL by X-gal 
histochemistry. 

(2) Animal Injections 

[237] All methods pertinent to animal utilization were approved by the University 

15 Committee on Animal Resources. Specifically, 1 2 male mice, C57BL/6J, under surgical plane 
of anesthesia (ketamine 60mg/Kg and xylazine 5 mg/Kg administered intraperitoneally) 
received a single injection of 5X10 6 FIV(lacZ) infectious particles (100 \x\ of stock solution) in 
the joint space of the right TMJ. Four additional mice received a single injection of 5X10 6 
FIVXA'lac) infectious particles (100 |d of stock solution) in the joint space of the right TMJ. In 

20 brief, the hair of the skin covering the right TMJ was shaved and the skin cleaned with Betadine 
solution. The joint was approached with an antero-posterior incision between the posterior end 
of the zygomatic arch and the ear cartilage, followed by a blunt dissection to expose the 
zygomatic arch and the posterior margin of the articular emminence. The joint space was not 
exposed during this procedure. The posterior margin of the emminense was identified by 

25 palpation and a 1-ml tuberculin syringe with a 271/2 gage needle was employed to inject the 
experimental solutions in the joint. This surgically assisted intra-articular injection technique 
was utilized to minimize leakage or spreading of the injectable solution beyond the articular 
space. (Kyrkanides S, et al. (2002). J Orofac Pain 16: 229-235). In addition, 2 mice that 
received 100 pi saline injection served as controls. Forty^five days following treatment, the mice 

30 were deeply anesthetized by pentobarbital (lOOmg/Kg IP) and euthanised by transcardial 
perfusion of 4% paraformaldehyde in phosphate buffered saline (PBS) (Kyrkanides S, et al. 
(2002a). J Orofac Pain 16: 229-235, Kyrkanides S, et al. (2002). Mol Brain Res 104: 159-169). 
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The trigeminal ganglia and brain stem were dissected and sectioned at 20 jam using a freezing 
microtome. The TMJ joints were also dissected, decalcified in an EDTA buffered solution, 
embedded in paraffin and cut at 8 |im sections. All tissues were stored at -20°C until further 
processed. 

(3) X-Gal Histochemistry 

[23 8] Sections of trigeminal ganglia were processed by X-gal histochemistry and 
evaluated under light microscopy. Specifically, the sections were washed in 0. 1 5M phosphate 
buffered saline (PBS) pH 7.2 for 60 min, followed by overnight processing in a staining solution 
containing 5-bromo-4chloro-3-indolyl-(5-D-galactopyranoside (lmg/ml), potasium fenicyanide 
(3mM), potasium ferrocyanide (3mM), NP-40 (0.02%) in 0.1M PBS pH 7.2 (Invitrogen) and 
MgCl 2 (1.3mM). The tissue was then washed in PBS for 30 min, and briefly rinsed with dH 2 0. 
Considerable attention was given so that only the bacterial form of P-galactosidase was. 
detected. The slides were cover slipped with DPX mounting medium (Fluka, Neu-Ulm, 
Switzerland) and examined under a light microscope (BX51 Olympus; Tokyo, Japan). Color 
microphotographic images were 'captured in TIFF 16-bit format using a SPOTRT Color CCD 
digital camera attached onto the microscope and connected to a PC computer. 

(4) Cell Counting 

[239] The mouse ganglia (1 ,5mm X 2mm X 3mm) were sectioned sagitally on a 
freezing cryotome along their long axis into 20fim thick sections. A total of 42 sections were 
approximately produced from each ganglion, which were sequentially collected onto 3 glass 
slides, whereby each slide contained representative ganglion sections 60fim apart of each other. 
One glass slide of each ganglion was processed by X-gal histochemistry and whas employed in 
cell counting: all X-gal positive (blue) cells were counted on each tissue section on the slides. 
Since the tissue sections were 60 jim apart, counting all blue cells on a single slide gave a 
representative number of infected cells in each ganglion while avoiding overlap between 
sections and subsequently any "double counting". 

(5) Immunocytochemistry 

[240] Tissue sections from trigeminal ganglia were analyzed by immunocytochemistry 
employing a rabbit anti p-galactosidase polyclonal antibody (Chemicon INTL, Tamacula CA). 
In brie£ sections were washed in BPS for 60 min followed by a 30 min blocking step in normal 
goat serum (4% in PBS) and overnight incubation in the primary antibody solution containing 
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rabbit anti p-galactosidase polyclonal antibody (1:2,500), 0.5% Triton-X, 4% normal goat serum 
(Invitrogen), 1% bovine serum albumin (Sigma; St Louis, MO) in PBS. The next morning the 
tissue was washed in PBS for 60 min, followed by a 30 min blocking step and incubated for 90 
min in the secondary antibody solution containing a goat anti-rabbit polyclonal antibody 
5 (1:2,000), Triton-X (0.5%) and normal goat serum (0.15%) in PBS. Subsequently, the tissue 
was washed in PBS for 30 min and incubated in a avidin-biodin complex solution (ABC kit; 
Vector Laboratories, Burlingame CA), and was then washed in 0.1 M sodium acetate buffered 
solution (pH 7.4) for 30 min. The tissue was then reacted in a DAB (3,3" diaminobenzidine) - 
Nickel solution in 0.1 M sodium acetate buffered solution (pH 7.4) for 5 min, followed by a 15 
10 min wash in PBS (Kyrkanides S, et al. (2002). J Orofac Pain 1 6: 229-235, Kyrkanides S, et al. 
(2002). Mol Brain Res 104: 159-169). The glass slides were then dehydrated through multiple 
ethanol solutions, cleared through xyaline and cover-slipped using DPX permanent mounting 
medium. The tissue sections were then studied under a light microscope and microphotbgraphic 
images were captured as described above. 

15 [24 1 ] Tissue sections from the temporomadibular joints were first deparafinized by 

immersion in a series of xylines and alcohols, followed by antigen retrieval processing (95°C 
heating for 15 sec in 0.1 M Tris-HCL buffer pH 8.9) and processing employing the 
aforementioned immunocytochemical method. 

(6) Polymerase Chain Reaction (PCR) 

20 [242] The DNA from the left and right trigeminal ganglia of 8 mice (4 control and 4 

experimental) was extracted employing the Trizol reagent (Invitrogen) according to 
manufacturer's instructions. The concentration of the recovered DNA ranged between 17-50 
ng/^1, and was analyzed for the presence of viral DNA by PCR employing the following primer 
sets. Detection of FIV viral DNA (Fig.6A): 5 TTT TTC CAG TTC CGT TTA TCC (SEQ ID 

25 NO:35) and TTT ATC GCC AAT CCA CAT CT 3 ' SEQ ID NO. 36 (T A =58°C; 40 total cycles). 
Detection of active P-galactosidase gene (Fig. 6A): 5 CCC ATA GTA ACG CCA ATA GG 
(SEQ ID NO:37) and AAA TGT GAG CGA GTA ACA ACC 3 ' SEQ ID NO. 38 (T A =59.6°C; 45 
total cycles). Detection of genomic DNA was performed utilizing primers designed for the 
murine G3PDH house keeping gene: ACC ACA GTC CAT GCC ATC AC SEQ ID NO. 39 and 

30 TCC ACC ACC CTG TTG CTG TA SEQ ID NO. 40 (T A =58°C; 30 cycles). A total of 400 ng 
was used as DNA template in the PCR reactions. The PCR products were analyzed by agarose 
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gel (1% w/v) electrophoresis and the images were captured utilizing a KODAK Image Analysis 
system (Rochester NY). 

b) RESULTS 

(1) Intra-articular FIV injection resulted in tranduction of 
5 hard and soft tissues 

[243] FIV(lacZ) injection into the TMJ articular space resulted in transfer of the 
reporter gene lacZ via the lentiviral vector in cells located within the articular capsule. 
Specifically, cells of the TMJ meniscus, presumably fibroblasts, expressed bacterial p- 
galactosidase as it was assessed by immunocytochemistry employing appropriate polyclonal 

10 antibodies- In addition, cells located in the hypertrophic zone of the condyle, primarily 

comprised of cartilaginous cells, as well as perivascular cells, including endothelial cells and 
possibly ostepcytes, also stained positive for bacterial P-galactosidase (Fig. 5). There was lack 
of P-galactosidase in the contralateral joints as well as the saline injected animals. These results 
indicate FIV successfully infected and stably transferred the reporter gene to cells of hard and 

15 soft TMJ tissues. 

(2) FIV injection into the TMJ resulted in transduction of 
trigeminal neurons 

[244] Two FIV vectors were employed in our experiment: the wild type FIV(lacZ) and 
the mutated FTV(A'Iac) (Fig. 6A). FIV(A*lac) is capable of transducing cells with an inactive 

20 form of the reporter gene P-galactosidase compared to FIV(lacZ) which carries a full-length 
lacZ (Fig. 6B & 6C). Injection of either FIV vector in the right TMJ of mice resulted in 
transduction of neurons located in the ipsilateral trigeminal ganglia as assessed by PCR (Fig. 
7A). Full length lacZ gene was detected by PCR only in the FIV(lacZ) treated animals (Fig. 7B), 
accompanied by neuronal P-galactosidase expression as assessed by X-gal histochemistry. The 

25 X-gal staining was localized primarily in the posterolateral part of the ganglion within the cell 
bodies of cells that appear histologically as neurons (Fig. 8A & 8B). In fact, the cell bodies of 
the primary sensory neurons that innervate the TMJ are known to localize in this part of the 
trigeminal ganglion. In contrast, FIV(A'lac) injected mice did not display any X-gal positive 
cells in the ganglia (Fig. 8C). Expression of bacterial P-galactosidase in the trigeminal ganglia 

30 was also confirmed by immunocytochemistry in the FIV(lacZ) (Fig. 8D) but not the FrV(A'lac) 
treated mice (Fig. 8E). Moreover, analysis of sections from the brain stem did not reveal any X- 
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gal positive staining, as it was anticipated since the vectors are defective, do not replicate and 
cannot infect second order neurons. 

(3) Considerable number of trigeminal neurons were infected 
byFIV(lacZ) 

5 [245] The mouse ganglia were on average of the following dimensions: 1 .5mm X 2mm 

X 3mm. As described above, a total of 42 sections (20 thick) were produced approximately 
from each ganglion, which were sequentially collected onto 3 glass slides, whereby each slide 
contained representative ganglion sections 60jnm apart of each other. Consistently, 4 sections 
were identified containing X-gal (blue) cells on each glass slide, with an average of 93 (+/- 7.64 

10 S.D.) blue cells per section. Therefore, we infer that there were approximately 93 cells X 4 
sections X 3 glass slides =1,116 transduced neurons in each right-sided ganglion in the FIV- 
injected animals. No X-gal positive cells were identified in the saline injected animals. These 
results suggest that from a total of 5xl0 6 infectious particles injected into the articular TMJ 
space approximately 10 3 nerve fibers were infected resulting in lacZ expression, presumambly 

1 5 following uptake of the lentiviral vector by peripheral nerve fibers and retrograde transport to the 
nucleus. 

[246] The results shown herein demonstrated that intra-articular injection of FIV(lacZ) 
resulted in successful gene transfer to articular TMJ surfaces as well as the joint meniscus. 
Interestingly, VSV-G does not require interaction between the viral envelope protein and a 

20 specific membrane receptor, but instead interacts with a phospholipid component of the cell 
membrane leading to membrane-fusion mediated entry. This characteristic confers broad host- 
cell range for VSV-G pseudotyped viruses (Burns JC, et al. (1993). Proc Natl Acad Sci USA 90: 
8033-8037; Cameiro FA, et al. (2002). J Virol 76: 3756-3764). Therefore, it is possible that FIV 
vectors demonstrate higher infectivity for TMJ tissues than previously described viral vectors 

25 (Kuboki T, et al. (1999). Arc Oral Biol 44: 701-709), as well as result in prolonged transgene 
expression secondary to stable transgene integration (Poeschla EM, et al. (1998). Nature Med 4: 
354-357): 

[247] The efficacy of VSV-G pseudotyped FIV vectors to transduce peripheral tissues 
(Kang Y, et al.(2002). J Virol 76: 9378-9388), as well as the brain (Bloemer U, et al. (1997). J 
30 Virol 71 : 6641-6649) and cerebellum (Alisky JM, et al. (2002) MolNeurosci 11: 2669-2673) 
has been previously demonstrated. The present observations of cells staining positively for X- 
gal in the trigeminal ganglion ipsilateral to the site of injection indicates that FIV virions were 
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taken up by peripheral nerve projections of trigeminal sensory neurons that lead to infection and 
expression of the reporter gene lacZ by these neurons. Therefore, VSV-G pseudotyped 
lentiviruses, such as the defective feline or human immunodeficiency virus, can serve as the 
platform for the transfer of antinociceptive genes to temporomandibular joint tissues as well as 
5 the neurons that innervate these structures. 

3. Example 3: 

a) Vector construction 

[248] The ViraPower™ Lentiviral Expression System that can create a replication 
incompetent HF/-1 -based lentivirus (Invitrogen, Carlsbad CA) was employed. This system can 
10 deliver and express NSE/Human-ji-opioid receptor in either dividing or non-dividing 

mammalian cells. First the pLenti6A^5-D-TOPO vector was reconstructed by insert PCR 
product which was generated base on the multiple cloning site of pIRES vector (Clontech Inc, 
Palo Alto CA) with the PCR primers: MCS-upper primer 5*- 

C ACCTAATACG ACTC ACTATAGG-3 ' SEQ ID NO. 41 and MCS-lower primer 5'- 
1 5 CATTAACCCTCACTAAAG-3 ' SEQ ID NO. 42. This 707 bp PCR product was purified and 
cloned into plenti6/V5-D-TOPO vector directional (CACC, 4 base pair with overhang sequence 
will anneal to the GTGG sequence in the pLenti6/V5-D-TOPO vector). Multiple cloning sites 
were used as the template for PCR amplification to insert Nhel site to the 5' end and a multiple 
en2yme digestion sequence followed by a NotI site to the 3' end of fragment . Then the CMV 
20 promoter of pLenti6/V 5-D-TOPO was removed with Clal and Spel restriction enzyme 

digestions, followed by isopropanol DNA purification. Both ends of the re-constructed vector 
were blunted with T4 DNA polymerase (Invitrogen, Carlsbad CA) and ligated with T4 DNA 
ligase (Invitrogen) according to manufacturer's instruction. 

[249] The NSE promoter was originally from pTR-NT3myc-NSE vector (Described in 
25 Peel AL. et al., Gene Therapy. 4(1): 16-24, 1 997). The NSE promoter sequence can be found in 
SEQ ID NO: 52.). The NSE fragment was cut out with BgU and Hindffl restriction enzyme 
digestions. The Bgll site of this fragment was blunted. Later on, NSE fragment (2050 bp) was 
ligated into HindTIT and blunted Xhol sites of pBluescript II KS+/- phagemid to form 
pBluescript II KS-NSE. 
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[250] Human-|*-opioid receptor (HUMOR) DNA was from pcDNA3-Human-ji-opioid 
receptor (Wang JB, et al., PNAS USA 90:10230-4) The sequence of this particular Human-|x- 
opioid receptor is found in SEQ ED NO:53. 

[25 1 ] This 1 .6 Kb fragment was cut out of the vector with EcoRV and Xbal, and 
5 inserted into EcoRV/Xbal sites of pBluescript n KS-NSE to form pBluescript H KS-NSE- 
HUMOR. This structure was digested with Kpnl and blunted, followed by the digestion of 
Xbal. The whole NSE-Human ja-opioid fragment was ligated into pIRES plasmid at Xbal and 
EcoRI (blunt) sites, and become pIRES- NSE-Human ^-opioid vector. To Insert the NSE- 
human ji-opioid receptor genes into the constructed pLenti6/V5-D-TOPO without CMV 
10 promoter, pIRES-NSE-human p.-opioid was digested with Nhel and Sail restriction enzymes 
and ligated into Nhel/Sall sites of pLenti6/V5-d-TOPO at 14°C overnight and pLenti6/V5-D- 
TOPO-NSE-HUMOR vector was constructed 

[252] In order to increase the efficacy of virus packaging, cPPT sequence was added to 
the front of NSE of the pLenti6A^5-D-TOPO-NSE-HUMOR. Plasmid pLPl (SEQ ID NO:49) 

1 5 , was used as template to PCR amplifying cPPT fragements with Clal and Nhel sites at both ends 
(upper primer 5'-atatcgatatcgctagcttttaaaagaaaaggggg-3' SEQ ID NO. 43 and lower primer 5'- 
taatcgatgctaagcaaaattttgaatttttgtaatttg-3' SEQ ID NO. 44). The PCR products were digested with 
Nhel, and resulting fragment was ligated into the Nhel site of pI^nti6/V5-D-TOPO-NSE- 
HUMOR to generate pLenti6A^5-D-TOPO-cPPT-NSE-H UMOR plasmid at 4°C overnight A 

20 schematic description of the vector is disclosed herein. 

[253] The pLenti6/v5-D-TOPO-cppt-NSE-HUMOR (SEQ ID NO:48) were transiently 
co-transfected along with the three packaging plasmids, pLPl(SEQ ID NO:49), pLP2 (SEQ ID 
NO:50),and pLP/VSVG(SEQ ID NO:51), into 293FT cells (Invitrogen) cultured in DEME 
(Invitrogen) plus 10% FBS (Gemini, Woodland CA). After 24 hours, the medium was replaced 
25 with fresh medium supplemented with non-essential amino acid (Invitrogen). Seventy hours 
post-transfection, the supernatant was collected and filtered through 0.45 Acrodisc 25 mm 
syringe filter (Pall Corporation, Gelman Laboratory). Aliquots of virus were frozen at -80°C 
until further use. Tittering of the virus was performed on NIH3T3 cells cultured in 6 well tissue 
plates and assessed at 3x10 3 colonies (bfu)/ml by blasticidin selection. 
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b) Infection of albino neurons cells with Lenti6/NSE-Humor vims 

[254] Neuro-2a cell was plated into 6 well plate and cultured in MEM (GIBCO/BRL) 
with 10% FBS. To infected Lenti6/NSE-HUMOR virus, 1ml viral solution (3x10 3 ip) with 6 \xg 
polybrene solution was added to N2<x cell culture. After overnight plating, the medium was 
5 changed to regular MEM with 10% FBS. The cells were harvested after 96 hours infection. 

c) RT Polymerase Chain Reaction (RT-PCR) 

[255] The total RNA of the N2a cells infected with Lenti6/NSE-HUMOR virus was 
extracted with Trizol reagent (Livitrogen) according to manufacture instruction. 5 jig total RNA 
was used to syntheses first strain DNA with Superscript™ First-Strand Synthesis System for 

10 RT-PCR (fcivitrogen). Analysis of the presence of HUMOR gene was done by PGR, employing 
the following primer sets: S'-GAATTACCTAATGGGAACATGG-S ' (SEQ ID NO:45) and 5 9 - 
GCAGACGATGAACACAGC-3 ' (SEQ ID NO:46) (T A = 56°C, total 30 cycles). G3PDH house 
keeping gene was used as quantity PCR control. Detection of genomic G3PDH DNA was 
performed with primers 5'-ACCACAGCAATCAC-3' (SEQ ID NO:47) and 5'- 

15 TCC ACC ACCCTGTTGCTGTA-3 ' (SEQ ID NO:40) (Ta=58°C, 30 cycles). The PCR products 
were analyzed by agarose gel (1% w/v) electrophoresis and imaged were captured by a KODAK 
image analysis system (Rochester NY). 
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EL Sequences 

1. SEQ ID NO:l Homo sapiens HUMOR, protein Genbank Accession No. 

2. SEQ ID NO:2 Homo sapiens HUMOR, cDNA Genbank Accession No 
3 • SEQ ID NO:3 Murine HUMOR, protein Genbank Accession No 

4. SEQ ID NO:4 Murine HUMOR, cDNA Genbank Accession No 

5. SEQ ID NO:5 : human kappa opioid receptor cDNA 

6. SEQ ID NO:6 human delta opioid receptor cDNA sequence 

7 . SEQ ID NO:7 JTV(Opioid receptor construct) 

8. SEQ ID NO:8 FIV(LacZ) (a construct can be used for nerve 
transduction) 

9. SEQ ID NO:9 HUMOR degenerate cDNA G to A change at position 94 

10. SEQ ID NO:10: HUMOR polypeptide conservative substitution of 
Val32 to 132 

11. SEQ ID NO:ll: Neuron specific enolase promoter 

12. SEQ ID NO:12 FIV backbone 

13. SEQ ID NO:13: Packaging vector 

14. SEQ ID NO:14 FIV-NSE-HUMOR -pA 

15. SEQ ID NO:15: Mu-opioid RECEPTOR Bovine ACCESSION 
NPJ776833 

16. SEQ ID NO:16: Bos taurus mu opioid receptor mRNA, complete cds. 
ACCESSION U89677 

17. SEQ ID NO:17:mu opioid receptor - mouse. 
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18. SEQ ID NO:18: Mus musculus mu opioid receptor cDNA, complete cds 
ACCESSION U19380 

19. SEQ ID NO:19: mu opioid receptor - rat ACCESSION 156504 

20. SEQ ID NO:20: Rat mu opioid receptor mRNA, complete cds. 

5 21. SEQ ID NO:21 : mu opioid receptor [Sus scrofa] porcine ACCESSION 

AAB53770 

22. SEQ H) NO:22: Sus scrofa porcine mu opioid receptor mRNA, complete 
cds ACCESSION AF521309 

23. SEQ DD NO:23: DELTA-opioid RECEPTOR ACCESSION AAA18789 

10 24. SEQ D3 NO:24: Human delta opioid receptor mRNA complete cds 

ACCESSION U07882 

25. SEQ ID NO:25: delta opioid receptor [Sus scrofa] ACCESSION 
AAB39694 

26. SEQ D3 NO:26: delta opioid receptor [Rattus norvegicus] ACCESSION 
15 AAA19939 

27. SEQ ID NO:27: delta-opioid receptor [Mus musculus] ACCESSION 
AAA37522 

28. SEQ ID NO:28: Mus musculus delta-opioid receptor mRNA, cpl cds 
AccNo. L06322 

20 29. SEQ ID NO: 29: Homo sapiens (human) kappa opioid receptor 

ACCESSION AAA63906 

30. SEQ ID NO: 30: Human kappa opioid receptor (hKOR) mRNA, 
complete cds Acc No. U17298 

31. SEQ ID NO: 31: kappa opioid receptor [Mus musculus] ACCESSION 
25 AAA39363 

32. SEQ ID NO: 32: Mouse kappa opioid receptor mRNA, complete cds 
ACCESSION L11065 

33. SEQ ID NO: 33: kappa opioid receptor [Rattus norvegicus] 
ACCESSION AAA41496 
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34. SEQ ID NO: 34: Rattus norvegicus mRNA for kappa opioid receptor, 
complete cds ACCESSION D16829 



— 80 



WO 2004/073646 



PCT/US2004/004914 



L References 

Alisky JM, Hughes SM, Sauter SL, Joly D, Dubensky Jr. TW, Staber, PD, et al. (2002) 
Transduction of murine cerebellar neurons with recombinant FIV and AAV5 vectors. Mol 
Neurosci 11: 2669-2673. 

5 Baum BJ, Kok M, Iran SD, Yamano S (2002). The impact of gene therapy on dentistry: a 
revisiting after six years. JADA 1 33 : 35-44. 

Bernick S (1962). The vascular and nerve supply to the temporomandibular joint of the rat. Oral 
Surg 15:488-492. 

Bloemer U, Naldini L, Kafii T, Trono D, Verma IM, Gage FH (1997). Highly efficient and 
10 sustained gene transfer in adult neurons with a lentivirus vector. J Virol 71 : 6641-6649. 

Burns JC, Fiedmann T, Driever W, Burrascanno M, Yee J-K (1993). Vesicular stomatitis virus 
G glycoprotein pseudotyped retroviral vectors: Concentration to very high titer and efficient 
gene transfer into mammalian and nomnammalian cells. Proc Natl AcadSci USA 90: 8033- 
8037. 

1 5 Capra NF (1 987). Localization and central projections of primary afferent neurons that innervate 
the temporomandibular joint in cats. Somatosensory Res 4: 201-213. 

Carneiro FA, Bianconi L, Weissmueller G, Stauffer F, Da Poian AT (2002). Membrane 
recognition by vesicular stomatitis virus involves enthalpy-driven protein-lipid interactions. J 
Virol 76: 3756-3764. 

20 Cox BM, Ginsburg M, Osman OH (1968). Acute tolerance to narcotic drugs in rats. Br J 
Pharmacol 245-256. 

Dreessen D, Halata Z, Strasmann T (1990). Sensory innervation of the temporomandibular joint 
in the mouse. Acta Anat 139:154-160. 

Frommer J, Monroe CW (1966). The morphology and distribution of nerve fibers and endings 
25 associated with the mandibular joint of the mouse. J Dent Res 45: 1762-1766. 

Kang Y, Stein CS, Heth PA, Sinn PL, Penisten AK, Staber PD, et al.(2002). In vivo gene 
transfer using a nonprimate lentiviral vector pseudotyped with ross river virus glycoproteins. J 
Virol 76: 9378-9388. 

Kido MA, Kiyoshima T, Kondo T, Ayasaka N, Moroi R, Terada Y, Tanaka T (1993). 
30 Distribution of substance P and CGRP-like irrrmunoreactive nerve fibers in the rat 
temporomandibular joint J Dent Res 72:592-598. 

Kido MA, Kondo T, Ayasaka N, Terada Y, Tanaka T (1991). The peripheral distribution of 
trigeminal nerve fibers in the rat temporomandibular joint studied by an anterograde axonal 
transport method with wheat germ agglutinin-horseradish peroxidase. Arch Oral Biol 
35 36:397-400. 

Klineberg I (1971). Structure and function of temporomandibular joint innervation. Ann Royal 
Coll Surg Engl 49:268-288 



— 81 — 



WO 2004/073646 PCTYUS2004/004914 



Kuboki T, Nakanishi T, Kanyama M, Sonoyama W, Fujisawa T, Kobayashi K, et al. (1999). 
Direct adenovirus-mediated gene delivery to the temporomandibular joint in guinea-pigs. Arc 
Oral Biol 44: 701-709. 

Kyrkanides S, Moore AH, Olschowka JA, Daeschner JC, Williams JP, Hansen JT, OBanion MK 
5 (2002). Cyclooxygenase-2 modulates brain inflammation-related gene expression in central 
nervous system radiation injery. Mol Brain Res 104: 159-169. 

Kyrkanides S, Tallents RH, Macher DJ, Olschowka JO, Stevens SY (2002). Temporomandibular 
joint nociception: Effects of capsaicin on substance P-like immunoreactivity in the rabbit brain 
stem. J Orofac Pain 16: 229-235. 

10 Martin WR, Eades CG (1961). Demonstration of tolerance and physical dependence in the dog 
following a short-term infusion of morphine. JPliarmacol Exp Ther 133: 262-270. 

Peel AL. Zolotukhin S. Schrimsher GW. Muzyczka N. Reier PJ. Efficient transduction of green 
fluorescent protein in spinal cord neurons using adeno-associated virus vectors containing cell 
type-specific promoters. [Journal Article] Gene Therapy. 4(l):16-24, 1997 

15 Poeschla EM, Wong-Stall F, Looney Dl (1998). Efficient transduction of nondividing human 
cells by feline immunodeficiency virus lentiviral vectors. Nature Med 4: 354-357. 

Pohl M, Braz J (2001). Gene therapy of pain: emerging strategies and future directions. Eur J 
Pharmacol 429: 39-48. 

Romfh JH, Capra NF, Gatipon GB (1979). Trigeminal nerve and temporomandibular joint of the 
20 cat: A horseradish peroxidase study. Exp Neurol 65: 99-106. 

Sessle BJ, Hu JW (1991). Mechanisms of pain arising from articular tissues. Can J Physiol 
Pharmacol 69: 617-626. 

Thilander B (1964). Innervation of the temporomandibular disc in Man. Acta Odont Scan 
22:151-156. 

25 Wang JB, Imai Y, Eppler CM, Gregor P, Spivak CE, Uhl GR. (1993). mu opiate receptor: cDNA 
cloning and expression. PNAS USA 90: 10230-4 

Wink CS, St ! Onge M, Zimny ML (1992). Neural elements in the human temporomandibular 
articular disc. J Oral Maxillofac Surg 50:334-337 

Wu CL, Gany MG, Zollo RA, Yang J (2001). Gene therapy for the management of pain: part II: 
30 molecular targets. Anesthesiology 95 : 2 1 6-240. 

Yoshino K, Kawagishi S, Amano N (1998). Morphological characteristics of primary sensory 
and post-synaptic sympathetic neurons supplying the temporomandibular joint in the cat Arc 
Oral Biol; 43: 679-686. 



— 82 — 



WO 2004/073646 



PCT/US2004/004914 



V. CLAIMS 

What is claimed is: 

1 . A vector for delivering an opioid receptor to a nerve cell, comprising sequence 
encoding an opioid receptor and a vector backbone. 

2. The vector of claim 1, wherein the opioid receptor is a ji-opioid receptor. 

3. The vector of claim 2, wherein the |i-opioid receptor has a sequence with at least 80% 
identity to the sequence set forth in SEQ ID NO: 1 . 

4. The vector of claim 2, wherein the ^-opioid receptor has a sequence with at least 85% 
identity to the sequence set forth in SEQ ID NO: 1 . 

5. The vector of claim 2, wherein the ^l-opioid receptor has a sequence with at least 90% 
identity to the sequence set forth in SEQ ID NO: 1 . 

6. The vector of claim 2, wherein the |i-opioid receptor has a sequence with at least 95% 
identity to the sequence set forth in SEQ ID NO: 1 . 

7. The vector of claims 1 -6, wherein the composition further comprises a promoter. 

8. The vector of claim 7, wherein the promoter is a nerve cell specific promoter. 

9. The vector of claim 8, wherein the promoter is the neuron specific enolase promoter. 
The vector of claim 7, wherein the promoter is a NSE promoter. 

10. The vector of claim 7, wherein the vector backbone is a lentiviral vector backbone. 

I 

1 1 . The vector of claim 1 0, wherein the lentiviral vector backbone is a feline 
immunodeficiency vector (FIV). 

12. The vector of claim 1 1, wherein the FIV has a sequence with at least 80% identity to 
the sequence set forth in SEQ ID NO:7. 

13. A cell comprising the vector of claims 1-12. 

14. An animal comprising the cell of claim 13. 

1 5. A cell comprising the integrated product of the vector of claims 1-12. 

1 6. An animal comprising the cell of claim 1 5. 

17. A method of reducing pain in a subject, comprising administering the vector of 
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claims 1-12, to the subject, wherein the vector transduces a nerve cell. 

1-8. The method of claim 17, wherein administering the vector occurs at the point of 
pain. 

19. The method of claim 17, wherein administering the vector occurs at the distal end of 
the nerve cell. 

20. The method of claim 17, wherein administering the vector occurs in the peripheral 
nervous system. 

2 1 . The method of claim 1 7, wherein administering the vector occurs at the axon or axon 
terminal of the nerve cell. 

22. The method of claim 1 7, wherein administering the vector occurs at the dendrite of 
the nerve cell. 

23. The method of claim 17, wherein administering the vector occurs at the trigeminal 
ganglion. 

24. The vector of claim 1, wherein the vector comprises the sequence set forth in SEQ 
IDNO:48. 

25. The vector of claim 1 , wherein the vector comprises the sequence set forth in SEQ 
ID NO: 7. 

26. A method of producing the vector of claim 1, comprising linking the opiod receptor 
sequence operably to a promoter. 

27. A method of producing the cell of claim 13, comprising transfecting the vector of 
claim 1 into the cell. 

28. An animal produced by the process of administering the vector of claim 1 to the 
animal. 

29. The vector of claim 7, wherein the vector backbone is an HIV vector backbone. 
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SEQUENCE LISTING 

<110> University of Rochester 

Kyrkanides , Stephanos 
Tallents, Ross H. 

<120> Treatment of Pain Through Expression of 
Opioid Receptors 

<130> 21108. 0022P1 

<150> 60/448,663 
<151> 2003-02-19 

<160> 54 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 400 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 1 



Met 


Asp 


Ser 


Ser 


Ala 


Ala 


Pro Thr 


Asn 


Ala 


Ser 


Asn 


Cys 


Thr Asp 


Ala 


1 








5 








10 










15 




Leu 


Ala 


Tyr 


Ser 


Ser 


Cys 


Ser Pro 


Ala 


Pro 


Ser 


Pro 


Gly 


Ser 


Trp 


Val 








20 








25 










30 






Asn 


Leu 


Ser 


His 


Leu 


Asp 


Gly Asn 


Leu 


Ser 


Asp 


Pro 


Cys 


Gly Pro 


Asn 






35 








40 










45 








Arg 


Thr 


Asp 


Leu 


Gly 


Gly 


Arg Asp 


Ser 


Leu 


Cys 


Pro 


Pro 


Thr Gly 


Ser 




50 










55 








60 










Pro 


Ser 


Met 


He 


Thr 


Ala 


He Thr 


He 


Met 


Ala 


Leu 


Tyr 


Ser 


He 


Val 


65 










70 








75 










80 


Cys 


Val 


Val 


Gly 


Leu 


Phe 


Gly Asn 


Phe 


Leu 


Val 


Met 


Tyr 


Val 


He 


Val 










85 








90 










95 




Arg 


Tyr 


Thr 


Lys 


Met 


Lys 


Thr Ala 


Thr 


Asn 


He 


Tyr 


He 


Phe 


Asn 


Leu 








100 








105 










110 






Ala 


Leu 


Ala 


Asp 


Ala 


Leu 


Ala Thr 


Ser 


Thr 


Leu 


Pro 


Phe 


Gin 


Ser 


Val 






115 








120 










125 








Asn 


Tyr 


Leu 


Met 


Gly 


Thr 


Trp Pro 


Phe 


Gly 


Thr 


He 


Leu 


Cys 


Lys 


He 




130 










135 








140 










Val 


He 


Ser 


He 


Asp 


Tyr 


Tyr Asn 


Met 


Phe 


Thr 


Ser 


He 


Phe 


Thr 


Leu 


145 










150 








155 










160 


Cys 


Thr 


Met 


Ser 


Val 


Asp 


Arg Tyr 


He 


Ala 


Val 


Cys 


His 


Pro 


Val 


Lys 










165 








170 










175 




Ala 


Leu 


Asp 


Phe 


Arg 


Thr 


Pro Arg 


Asn 


Ala 


Lys 


He 


He 


Asn 


Val 


Cys 








180 








185 










190 






Asn 


Trp 


He 


Leu 


Ser 


Ser Ala He 


Gly 


Leu 


Pro 


Val 


Met 


Phe 


Met 


Ala 






195 








200 










205 








Thr 


Thr 


Lys 


Tyr 


Arg 


Gin Gly Ser 


He 


Asp 


Cys 


Thr 


Leu 


Thr Phe 


Ser 




210 










215 








220 










His 


Pro 


Thr 


Trp 


Tyr 


Trp Glu Asn 


Leu 


Leu 


Lys 


He 


Cys 


Val 


Phe 


He 


225 










230 








235 










240 
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Phe Ala 


Phe 


He Met Pro 
245 


Val 


Leu 


He 


He 
250 


Thr 


Val 


Cys 


Tyr 


Gly 
255 


Leu 


Met He 


Leu 


Arg Leu Lys 


Ser 


Val Arg 


Met 


Leu 


Ser Gly 


Ser 


Lys 


Glu 






260 






265 










270 






Lys Asp 


Arg 


Asn Leu Arg 


Arg 


He 


Thr 


Arg Met 


Val 


Leu 


Val 


Val 


Val 




275 






280 










285 








Ala Val 


Phe 


He Val Cys 


Trp 


Thr 


Pro 


He 


His 


He 


Tyr 

-* 


Val 


He 


He 


290 






295 










300 










Lys Ala 


Leu 


Val Thr He 


Pro 


Glu 


Thr 


Thr 


Phe 


Gin 


Thr 


Val 


Ser 


Trp 


305 




310 










315 










320 


His Phe 


Cys 


He Ala Leu 


Gly 


Tyr Thr 


Asn 


Ser 


Cys 


Leu 


Asn 


Pro 


Val 






325 








330 










335 




Leu Tyr 


Ala 


Phe Leu Asp 


Glu 


Asn 


Phe 


Lys Arg 


Cys 


Phe 


Arg 


Glu 


Phe 






340 






345 










350 






Cys He 


Pro 
355 


Thr Ser Ser 


Asn 


He 
360 


Glu 


Gin 


Gin 


Asn 


Ser 
365 


Thr 


Arg 


He 


Arg Gin 


Asn 


Thr Arg Asp 


His 


Pro 


Ser 


Thr 


Ala 


Asn 


Thr 


Val 


Asp 


Arg 


370 






375 










380 










Thr Asn 


His 


Gin Leu Glu 


Asn 


Leu 


Glu 


Ala 


Glu 


Thr Ala 


Pro 


Leu 


Pro 


385 




390 










395 










400 



<210> 2 
<211> 1200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

*400> 2 

atggacagca gcgctgcccc cacgaacgcc agcaattgca ctgatgcctt 
agttgctccc cagcacccag ccccggttcc tgggtcaact tgtcccactt 
ctgtccgacc catgcggtcc gaaccgcacc gacctgggcg ggagagacag 
ccgaccggca gtccctccat gatcacggcc atcacgatca tggccctcta 
tgcgtggtgg ggctcttcgg aaacttcctg gtcatgtatg tgattgtcag 
atgaagactg ccaccaacat ctacattttc aaccttgctc tggcagatgc 
agtaccctgc ccttccagag tgtgaattac ctaatgggaa catggccatt 
ctttgcaaga tagtgatctc catagattac tataacatgt tcaccagcat 
tgcaccatga gtgttgatcg atacattgca gtctgccacc ctgtcaaggc 
cgtactcccc gaaatgccaa aattatcaat gtctgcaact ggatcctctc 
ggtcttcctg taatgttcat ggctacaaca aaatacaggc aaggttccat 
ctaacattct ctcatccaac ctggtactgg gaaaacctgc tgaagatctg 
ttcgccttca ttatgccagt gctcatcatt accgtgtgct atggactgat 
ctcaagagtg tccgcatgct ctctggctcc aaagaaaagg acaggaatct 
accaggatgg tgctggtggt ggtggctgtg ttcatcgtct gctggactcc 
tacgtcatca ttaaagcctt ggttacaatc ccagaaacta cgttccagac 
cacttctgca ttgctctagg ttacacaaac agctgcctca acccagtcct 
ctggatgaaa acttcaaacg atgcttcaga gagttctgta tcccaacctc 
gagcaacaaa actccactcg aattcgtcag aacactagag accacccctc 
acagtggata gaactaatca tcagctagaa aatctggaag cagaaactgc 



<210> 3 
<211> 398 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 



ggcgtactca 60 
agatggcaac 120 
cctgtgccct 180 
ctccatcgtg 240 
atacaccaag 300 
cttagccacc 360 
tggaaccatc 420 
attcaccctc 480 
cttagatttc 540 
ttcagccatt 600 
agattgtaca 660 
tgttttcatc 720 
gatcttgcgc 780 
tcgaaggatc 840 
cattcacatt 900 
tgtttcttgg 960 
ttatgcattt 1020 
ttccaacatt 1080 
cacggccaat 1140 
tccgttgccc 1200 
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synthetic construct 



<400> 3 




















Met Asp Ser 


Ser Ala Gly 


Pro Gly Asn 


He 


Ser 


Asp 


Cys 


Ser 


Asp 


Pro 


1 


5 




10 










15 




Leu Ala Pro 


Ala Ser Trp 


Ser Pro Ala 


Pro 


Gly 


Ser 


Trp 


Leu 


Asn 


Leu 




20 


25 










30 






Ser His Val 


Asp Gly Asn 


Gin Ser Asp 


Pro 


Cys 


Gly 


Pro 


Asn 


Arg 


Thr 


35 




40 








45 








Gly Leu Gly 


Gly Ser His 


Ser Leu Cys 


Pro 


Gin 


Thr 


Gly 


Ser 


Pro 


Ser 


50 




55 






60 










Met Val Thr 


Ala He Thr 


He Met Ala 


Leu 


Tyr 


Ser 


He 


Val 


Cys 


Val 


65 


70 






75 










80 


Val Gly Leu 


Phe Gly Asn 


Phe Leu Val 


Met 


Tyr 


Val 


He 


Val 


Arg 


Tyr 




85 




90 










95 




Thr Lys Met 


Lys Thr Ala 


Thr Asn He 


Tyr 


He 


Phe 


Asn 


Leu 


Ala 


Leu 




100 


105 










110 






Ala Asp Ala 


Leu Ala Thr 


Ser Thr Leu 


Pro 


Phe 


Gin 


Ser 


Val 


Asn 


Tyr 


115 




120 








125 








Leu Met Gly 


Thr Trp Pro 


Phe Gly Asn 


He 


Leu 


Cys 


Lys 


He 


Val 


He 


130 




135 






140 










Ser lie Asp 


Tyr Tyr Asn 


Met Phe Thr 


Ser 


He 


Phe 


Thr 


Leu 


Cys 


Thr 


145 


150 






155 










160 


Met Ser Val 


Asp Arg Tyr 


He Ala Val 


Cys 


His 


Pro 


Val 


Lys 


Ala 


Leu 




165 




170 










175 




Asp Phe Arg 


Thr Pro Arg 


Asn Ala Lys 


He 


Val 


Asn 


Val 


Cys 


Asn 


Trp 




180 


185 










190 






lie Leu Ser 


Ser Ala He 


Gly Leu Pro 


Val 


Met 


Phe 


Met 


Ala 


Thr 


Thr 


195 




200 








205 








Lys Tyr Arg 


Gin Gly Ser 


He Asp Cys 


Thr 


Leu 


Thr 


Phe 


Ser 


His 


Pro 


210 




215 






220 










Thr Trp Tyr 


Trp Glu Asn 


Leu Leu Lys 


lie 


Cys 


Val 


Phe 


He 


Phe 


Ala 


225 


230 






235 










240 


Phe He Met 


Pro Val Leu 


He He Thr 


Val 


Cys 


Tyr 


Gly 


Leu 


Met 


He 




245 




250 










255 




Leu Arg Leu 


Lys Ser Val 


Arg Met Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Lys 


Asp 




260 


265 










270 






Arg Asn Leu 


Arg Arg He 


Thr Arg Met 


Val 


Leu 


Val 


Val 


Val 


Ala 


Val 


275 




280 








285 








Phe He Val 


Cys Trp Thr 


Pro He His 


He 


Tyr 


Val 


He 


He 


Lys 


Ala 


290 




295 






300 










Leu He Thr 


He Pro Glu 


Thr Thr Phe 


Gin 


Thr 


Val 


Ser 


Trp 


His 


Phe 


305 


310 






315 










320 


Cys He Ala 


Leu Gly Tyr 


Thr Asn Ser 


Cys 


Leu 


Asn 


Pro 


Val 


Leu 


Tyr 




325 




330 










335 




Ala Phe Leu 


Asp Glu Asn 


Phe Lys Arg 


Cys 


Phe 


Arg 


Glu 


Phe 


Cys 


He 




340 


345 










350 






Pro Thr Ser 


Ser Thr He 


Glu Gin Gin 


Asn 


Ser 


Ala 


Arg 


He 


Arg 


Gin 


355 




360 








365 








Asn Thr Arg 


Glu His Pro 


Ser Thr Ala 


Asn 


Thr 


Val 


Asp 


Arg 


Thr 


Asn 


370 




375 






380 










His Gin Leu 


Glu Asn Leu 


Glu Ala Glu 


Thr 


Ala 


Pro 


Leu 


Pro 






385 


390 






395 













<210> 4 
<211> 1194 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 4 

atggacagca gcgccggccc agggaacatc agcgactgct ctgacccctt agctcctgca 60 

agttggtccc cagcacctgg ctcctggctc aacttgtccc acgttgatgg caaccagtcc 120 

gacccatgcg gtcctaaccg cacggggctt ggcgggagcc acagcctgtg ccctcagacc 180 

ggcagccctt ccatggtcac agccatcacc atcatggccc tctattctat cgtgtgtgta 240 

gtgggcctct ttggaaactt cctggtcatg tatgtgattg taagatatac caaaatgaag 300 

actgccacca acatctacat tttcaacctt gctctggcag atgccttagc cactagcacg 360 

ctgccctttc agagtgttaa ctacctgatg ggaacgtggc cctttggaaa catcctctgc 420 

aagatcgtga tctcaataga ctactacaac atgttcacca gtatcttcac cctctgcacc 480 

atgagtgtag accgctacat tgccgtctgc cacccggtca aggccctgga tttccgtacc 540 

ccccgaaatg ccaaaattgt caatgtctgc aactggatcc tctcttctgc cattggtctg 600 

cccgtaatgt tcatggcaac cacaaaatac aggcaggggt ccatagattg caccctcact 660 

ttctctcatc ccacatggta ctgggagaac ctgctcaaaa tctgtgtctt catcttcgcc 720 

ttcatcatgc cggtcctcat catcactgtg tgttatggac tgatgatctt acgactcaag 780 

agtgtccgca tgctgtcggg ctccaaagaa aaggacagga acctgcgcag gatcacccgg 840 

atggtgctgg tggtcgtggc tgtatttatt gtctgctgga cccccatcca catctatgtc 900 

atcatcaaag cactgatcac gattccagaa accactttcc agactgtttc ctggcacttc 960 

tgcattgcct tgggttacac aaacagctgc ctgaacccag ttctttatgc gttcctggat 1020 

gaaaacttca aacgatgttt tagagagttc tgcatcccaa cttcctccac aatcgaacag 1080 

caaaactctg ctcgaatccg tcaaaacact agggaacacc cctccacggc taatacagtg 1140 

gatcgaacta accaccagct agaaaatctg gaagcagaaa ctgctccatt gccc 1194 

<210> 5 

<211> 2423 " . .. 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 5 

acccgcgctc gtacgtgcgc ctccgccggc agctcctgac tcatcggggg ctccgggtca 60 

catgcgcccg cgcggcccta taggcgcctc ctccgcccgc cgcccgggag ccgcagccgc 120 

cgccgccact gccactcccg ctctctcagc gccgccgtcg ccaccgccac cgccaccgcc 180 

actaccaccg tctgagtctg cagtcccgag atcccagcca tcatgtccat agagaagatc 240 

tgggcccggg agatcctgga ctcccgcggg aaccccacag tggaggtgga tctctatact 300 

gccaaaggtc ttttccgggc tgcagtgccc agtggagcct ctacgggcat ctatgaggcc 360 

ctggagctga gggatggaga caaacagcgt tacttaggca aaggtgtcct gaaggcagtg 420 

gaccacatca actccaccat cgcgccagcc ctcatcagct caggtctctc tgtggtggag 480 

caagagaaac tggacaacct gatgctggag ttggatggga ctgagaacaa atccaagttt 540 

ggggccaatg ccatcctggg tgtgtctctg gccgtgtgta aggcaggggc agctgagcgg 600 

gaactgcccc tgtatcgcca cattgctcag ctggccggga actcagacct catcctgcct 660 

gtgccggcct tcaacgtgat caatggtggc tctcatgctg gcaacaagct ggccatgcag 720 

gagttcatga tcctcccagt gggagctgag agctttcggg atgccatgcg actaggtgca 780 

gaggtctacc atacactcaa gggagtcatc aaggacaaat acggcaagga tgccaccaat 840 

gtgggggatg aaggtggctt tgcccccaat atcctggaga acagtgaagc cttggagctg 900 

gtgaaggaag ccatcgacaa ggctggctac acggaaaaga tcgttattgg catggatgtt 960 

gctgcctcag agttttatcg tgatggcaaa tatgacttgg acttcaagtc tcccactgat 1020 

ccttcccgat acatcactgg ggaccagctg ggggcactct accaggactt tgtcagggac 1080 

tatcctgtgg tctccattga ggacccattt gaccaggatg attgggctgc ctggtccaag 1140 

ttcacagcca atgtagggat ccagattgtg ggtgatgacc tgacagtgac caacccaaaa 1200 

cgtattgagc gggcagtgga agaaaaggcc tgcaactgtc tgctgctcaa ggtcaaccag 1260 

atcggctcgg tcactgaagc catccaagcg tgcaagctgg cccaggagaa tggctggggg 1320 

gtcatggtga gtcatcgctc aggagagact gaggacacat tcattgctga cctggtggtg 1380 

gggctgtgca caggccagat caagactggt gccccgtgcc gttctgaacg tctggctaaa 1440 

tacaaccagc tcatgagaat tgaggaagag ctgggggatg aagctcgctt tgccggacat 1500 

aacttccgta atcccagtgt gctgtgattc ctctgcttgc ctggagacgt ggaacctctg 1560 
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tctcatcctc ctggaacctt gctgtcctga tctgtgatag ttcaccccct gagatcccct 1620 
gagccccagg gtgcccagaa cttccctgat tgacctgctc cgctgctcct tggcttacct 1680 
gacctcttgc tgtctctgct cgccctcctt tctgtgccct actcattggg gttccgcact 1740 
ttccacttct tcctttctct ttctctcttc cctcagaaac tagaaatgtg aatgaggatt 1800 
attataaaag ggggtccgtg gaagaatgat cagcatctgt. gatgggagcg tcagggttgg 1860 
tgtgctgagg tgttagagag ggaccatgtg tcacttgtgc tttgctcttg tcccacgtgt 1920 
cttccacttt gcatatgagc cgtgaactgt gcatagtgct gggatggagg ggagtgttgg 1980 
gcatgtgatc acgcctggct aataaggctt tagtgtattt atttatttat ttattttatt 2 040 
tgtttttcat tcatcccatt aatcatttcc ccataactca atggcctaaa actggcctga 2100 
cttgggggaa cgatgtgtct gtatttcatg tggctgtaga tcccaagatg actggggtgg 2160 
gaggtcttgc tagaatggga agggtcatag aaagggcctt gacatcagtt cctttgtgtg 2220 
tactcactga agcctgcgtt ggtccagagc ggaggctgtg tgcctggggg agttttcctc 2280 
tatacatctc tccccaaccc taggttccct gttcttcctc cagctgcacc agagcaacct 2340 
ctcactcccc atgccacgtt ccacagttgc caccacctct gtggcattga aatgagcacc 2400 
tccattaaag tctgaatcag tgc 2423 

<210> 6 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence ; /note = 

synthetic construct ■ , 

<400> 6 

cdgaggagcc tgcgctgdtc ctggctcaca gcgctccggg cgaggagagc gggcggaccg 60 
gggggctggg ccggtgcggg cggcgaggca ggcggacgag gcgcagagac agcggggcgg 120 
ccggggcgcg gcacgcggcg ggtcggggcc ggcctctgcc ttgccgctcc cctcgcgtcg 180 
gatccccgcg cccaggcagc cggtggagag ggacgcggcg gacgccggca gccatggaac 240 
cggccccctc cgccggcgcc gagctgcagc ccccgctctt cgccaacgcc tcggacgcct 300 
accctagcgc cttccccagc gctggcgcca atgcgtcggg gccgccaggc gcgcggagcg 360 
cctcgtccct cgccctggca atcgccatca ccgcgctcta ctcggccgtg tgcgccgtgg 420 
ggctgctggg caacgtgctt gtcatgttcg gcatcgtccg gtacactaag atgaagacgg 480 
ccaccaacat ctacatcttc aacctggcct tagccgatgc gctggccacc agcacgctgc 540 
ctttccagag tgccaagtac ctgatggaga cgtggccctt cggcgagctg ctctgcaagg 600 / 
ctgtgctctc catcgactac tacaatatgt tcaccagcat cttcacgctc accatgatga 660 
gtgttgaccg ctacatcgct gtctgccacc ctgtcaaggc cctggacttc cgcacgcctg 720 
ccaaggccaa gctgatcaac atctgtatct gggtcctggc ctcaggcgtt ggcgtgccca 780 
tcatggtcat ggctgtgacc cgtccccggg acggggcagt ggtgtgcatg ctccagttcc 840 
ccagccccag ctggtactgg gacacggtga ccaagatctg cgtgttcctc ttcgccttcg 900 
tggtgcccat cctcatcatc accgtgtgct atggcctcat gctgctgcgc ctgcgcagtg 960 
tgcgcctgct gtcgggctcc aaggagaagg accgcagcct gcggcgcatc acgcgcatgg 1020 
tgctggtggt tgtgggcgcc ttcgtggtgt gttgggcgcc catccacatc ttcgtcatcg 1080 
tctggacgct ggtggacatc gaccggcgcg acccgctggt ggtggctgcg ctgcacctgt 1140 
gcatcgcgct gggctacgcc aatagcagcc tcaaccccgt gctctacgct ttcctcgacg 1200 
agaacttcaa gcgctgcttc cgccagctct gccgcaagcc ctgcggccgc ccagacccca 1260 
gcagcttcag ccgcgcccgc gaagccacgg cccgcgagcg tgtcaccgcc tgcaccccgt 1320 
ccgatggtcc cggcggtggc gctgccgcct gaccaggcca tccggccccc agacgcccct 1380 
ccctagttgt acccggaggc cacatgagtc ccagtgggag gcgcgagcca tgatgtggag 1440 
tggggccagt agataggtcg gagggctttg ggaccgccag atggggcctc tgtttcggag 1500 
acgggaccgg gccgctagat gggcatgggg tgggcctctg gtttggggcg aggcagagga 1560 
cagatcaatg gcgcagtgcc tctggtctgg gtgcccccgt ccacggctct aggtggggcg 1620 
ggaaagccag tgactccagg agaggagcgg gacctgtggc tctacaactg agtccttaaa 1680 
cagggcatct ccaggaaggc ggggcttcaa ccttgagaca gcttcggttt ctaacttgga 1740 
gccggacttt cggagttggg gggtccgggg ccc 1773 

<210> 7 
<211> 9426 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 7 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 120 
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 180 
gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 
agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 
tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 360 
cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 420 
tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg^ 480 
agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata 540 
aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 600 
acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 660 
gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 720 
gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 780 
gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 840 
gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 900 
gtagcaggac ttttaaatat gacgggtgtc tactgctgct gcagctgaaa atatgtattc 960 
tcaaatggga ttagacacta ggccatctat gaaagaagca ggtggaaaag aggaaggccc 1020 
tccacaggca tatcctattc aaacagtaaa tggagtacca caatatgtag cacttgaccc 1080 
aaaaatggtg tccattttta tggaaaaggc aagagaagga ctaggaggtg aggaagttca 1140 
actatggttt actgccttct ctgcaaattt aacacctact gacatggcca cattaataat 1200 
ggccgcacca gggtgcgctg cagataaaga aatattggat gaaagcttaa agcaactgac 1260 
agcagaatat gatcgcacac atccccctga tgctcccaga ccattaccct attttactgc 1320 
agcagaaatt atgggtatag gattaactca agaacaacaa gcagaagcaa gatttgcacc 1380 
agctaggatg cagtgtagag catggtatct cgaggcatta ggaaaattgg ctgccataaa 1440 
agctaagtct cctcgagctg tgcagttaag acaaggagct aaggaagatt attcatcctt 1500 
tatagacaga ttgtttgccc aaatagatca agaacaaaat acagctgaag ttaagttata 1560 
tttaaaacag tcattgagca tagctaatgc taatgcagac tgtaaaaagg caatgagcca 1620 
ccttaagcca gaaagtaccc tagaagaaaa gttgagagct tgtcaagaaa taggctcacc 1680 
aggatataaa atgcaactct tggcagaagc tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aggatcagga ccagtgtgtt ttaattgtaa aaaaccagga catctagcaa gacaatgtag 1800 
agaagtgaaa aaatgtaata aatgtggaaa acctggtcat gtagctgcca aatgttggca 1860 
aggaaataga aagaattgta caagggaaga aagggataca acaattacaa aagtgggaag 1920 
attgggtagg atggatagga aatattccac aatatttaaa gggactattg ggaggtatct 1980 
tgggaatagg attaggagtg ttattattga ttttatgttt acctacattg gttgattgta 2 040 

:taagaaattg tatccacaag atactaggat acacagtaat tgcaatgcct gaagtagaag 2100 
gagaagaaat acaaccacaa atggaattga ggagaaatgg taggcaatgt ggcatgtctg 2160 
aaaaagagga ggaatgatga agtatctcag acttatttta taagggagat actgtgctga 2220 
gttcttccct ttgaggaagg tatgtcatat gaatccattt cgaaatcaaa ttcctcctct 2280 
gctcgcccaa tccttccaac cccctatggt ggtatggctg acacagaaaa tgtctgctcc 2340 

. tgtatgggac atttgcccct cttctccaaa tataagacag gatgaggcct agcttttgct 2400 
gctccaaagt tttaaaagaa cacattgcac ggcatttagg gactctaaag ggtggaggag 2460 
gaatgaggga attgcatcat gccaaggctg gtcctcatcc atcactgctt ccagggccca 2520 
gagtggcttc caggaggtat tcttacaaag gaagcccgat ctgtagctaa cactcagagc 2580 
ccattttcct gcgttaaccc ctcccgacct catatacagg agtaacatga tcagtgacct 2640 
gggggagctg gccaaactgc gggacctgcc caagctgagg gccttggtgc tgctggacaa 2700 
cccctgtgcc gatgagactg actaccgcca ggaggccctg gtgcagatgg cacacctaga 2760 
gcgcctagac aaagagtact atgaggacga ggaccgggca gaagctgagg agatccgaca 2820 
gaggctgaag gaggaacagg agcaagaact cgacccggac caagacatgg aaccgtacct 2880 
cccgccaact tagtggctcc tctagcctgc agggacagta aaggtgatgg caggaaggca 2940 
gcccccggag gtcaaaggct gggcacgcgg gaggagaggc cagagtcaga ggctgcgggt 3000 
atctcagata tgaaggaaag atgagagagg ctcaggaaga ggtaagaaaa gacacaagag 3060 
accagagaag ggagaagaat tagagaggga ggcagaggac cgctgtctct acagacatag 3120 
ctggtagaga^ctgggaggaa gggatgaacc ctgagcgcat gaagggaagg aggtggctgg 3180 
tggtatatgg aggatgtagc tgggccaggg aaaagatcct gcactaaaaa tctgaagcta 3240 
aaaataacag gacacggggt ggagaggcga aaggagggca gattgaggca gagagactga 3300 



6/53 



WO 2004/073646 



PCT/US2004/004914 



gaggcctggg gatgtgggca ttccggtagg 
aggaggccar agatgctgac ctcaagaact 
atagccctgt tacaagaagt gggagatgtt 
atcccgaggc tcaggttctg tggtggtcat 
aaagtcctaa gcacagctct caagcagatc 
ctggagagaa gagccgagga ggtgggctgt 
gcgtttttat attctgtgtc gagttcgtga 
ggttagtacc tgccactata ctggaataag 
caaggttatg agcatccgtg tacttatggg 
ccttccccca cccgttcggt tccccaccac 
cagctcttga ctcatcgggg cccccgggtc 
ccccctgccc accccccgcc cgcgctggga 
gcgccgccgc cgtcaccacc gccaccgcca 
aaaaaccaga aagttaactg gtaagtttag 
ccggtggtgg tgcaaatcaa agaactgctc 
tgtacggaag tgttacttct gctctaaaag 
cttgatatcg aattccggat gagcctctgt 
agaggagaat gtcagatgct cagctcggtc 
ccaggactgg tttctgtaag aaacagcagg 
aggcgcttgg aacccgaaaa gtctcggtgc 
cggccgtcag taccatggac agcagcgctg 
ccttggcgta ctcaagttgc tccccagcac 
acttagatgg caacctgtcc gacccatgcg 
acagcctgtg ccctccgacc ggcagtccct 
tctactccat cgtgtgcgtg gtggggctct 
tcagatacac caagatgaag actgccacca 
atgccttagc caccagtacc ctgcccttcc 
catttggaac catcctttgc aagatagtga 
gcatattcac cctctgcacc atgagtgttg 
aggccttaga tttccgtact ccccgaaatg 
tctcttcagc cattggtctt cctgtaatgt 
ccatagattg tacactaaca ttctctcatc 
tctgtgtttt catcttcgcc ttcattatgc 
tgatgatctt gcgcctcaag agtgtccgca 
atcttcgaag gatcaccagg atggtgctgg 
ctcccattca catttacgtc atcattaaag 
agactgtttc ttggcacttc tgcattgctc 
tcctttatgc atttctggat gaaaacttca 
cctcttccaa cattgagcaa caaaactcca 
ccctccacgg ccaatacagt ggatagaact 
actgctccgt tgccctagag ctcgctgatc 
atctgttgtt tgcccctccc ccgtgccttc 
cctttcctaa taaaatgagg aaattgcatc 
ggggsgtggg gtggggcagg acagcaaggg 
tggggtaaaa aagaaaaaag ggtggactgg 
agaatgctta tggactaggg actgtttacg 
gactcatagt taaagcgcta gcagctgcct 
gctaatgacg tataagttgt tccattgtaa 
gaggagtctc tttgttgagg acttttgagt 
atatttgaga ttgaaccctg tcgagtatct 
aatccgggcc gagaacttcg cagcggccgc 
catcacgaga tttcgattcc accgccgcct 
tccgggacgc cggctggatg atcctccagc 
accccaactt gtttattgca gcttataatg 
tcacaaataa agcatttttt tcactgcatt 
tatcttatca tgtctggatc ccgtcgacct 
gtttcctgtg tgaaattgtt atccgctcac 
aaagtgtaaa gcctggggtg cctaatgagt 
actgcccgct ttccagtcgg gaaacctgtc 
cgcggggaga ggcggtttgc gtattgggcg 
gcgctcggtc gttcggctgc ggcgagcggt 



gcacacagtt cacttgtctt ctctttttcc 3360 
cataataccc cagtggggac caccgcattc 3420 
cctttttgtc ccagactgga aatccattac 3480 
ctctgtgtgg cttgttctgt gggcctacct 3540 
cgaggcgact aagatgctag taggggttgt 3600 
gatggatcag ttcagctttc aaataaaaag 3660 
acccctgtgg tgggcttctc catctgtctg 3720 
gggacgcctg cttccctcga gttggctgga 3780 
gttgccagct tggtcctgga tcgcccgggc 3840 
cacccgcgct cgtacgtgcg tctccgcctg 3900 
acatgcgctc gctcggctct ataggcgccg 3960 
gccgcagccg ccgccactcc tgctctctct 4020 
ccggctgagt ctgcagtcct cgaggaactg 4080- 
tctttttgtc ttttatttca ggtcccggat 4140 
ctcagtggat gttgccttta cttctaggcc 4200 
ctgcggaatt gtacccgcgg ccaagctaag 4260 
gaactactaa ggtgggaggg ggctatacgc 432 0 
ccctccgcct gacgctcctc tctgtctcag 43~80 
agctgtggca gcggcgaaag gaagcggctg 4440 
tcctggctac ctcgcacagc ggtgcccgcc 4500 
cccccacgaa cgccagcaat tgcactgatg 4560 
ccagccccgg ttcctgggtc aacttgtccc 4620 
gtccgaaccg caccgacctg ggcgggagag 4680 
ccatgatcac ggccatcacg atcatggccc 4740 
tcggaaactt cctggtcatg tatgtgattg 4800 
acatctacat tttcaacctt gctctggcag 4860 
agagtgtgaa ttacctaatg ggaacatggc 4920 
tctccataga ttactataac atgttcacca 4980 
atcgatacat tgcagtctgc caccctgtca 5040 
ccaaaattat caatgtctgc aactggatcc 5100 
tcatggctac aacaaaatac aggcaaggtt 5160 
caacctggta ctgggaaaac ctgctgaaga 5220 
cagtgctcat cattaccgtg tgctatggac 5280 
tgctctctgg ctccaaagaa aaggacagga 5340 
tggtggtggc tgtgttcatc gtctgctgga 5400 
ccttggttac aatcccagaa actacgttcc 5460 
taggttacac aaacagctgc ctcaacccag 5520 
aacgatgctt cagagagttc tgtatcccaa 5580 
ctcgaattcg tcagaacatc tagagaccac 5640 
aatcatcagc tagaaaatct ggaagcagaa 5700 
agcctcgact gtgccttcta gttgccagcc 5760 
cttgaccctg gaaggtgcca ctcccactgt 5820 
gcattgtctg agtaggtgtc attctattct 5880 
ggaggattgg gaagacaata gcaggcatgc 5940 
gatgagtatt ggaaccctga agaaatagaa 6000 
aacaaatgat aaaaggaaat agctgagcat 6060 
aaccgcaaaa ccacatccta tggaaagctt 6120 
gagtatataa ccagtgcttt gtgaaacttc 6180 
tctcccttga ggctcccaca gatacaataa 6240 
gtgtaatctt ttttacctgt gaggtctcgg 6300 
tcatgaccga ccaagcgacg cccaacctgc 6360 
tctatgaaag gttgggcttc ggaatcgttt 6420 
gcggggatct catgctggag ttcttcgccc 6480 
gttacaaata aagcaatagc atcacaaatt 6540 
ctagttgtgg tttgtccaaa ctcatcaatg 6600 
cgagagcttg gcgtaatcat ggtcatagct 6660 
aattccacac aacatacgag ccggaagcat 6720 
gagctaactc acattaattg cgttgcgctc 6780 
gtgccagctg cattaatgaa tcggccaacg 6840 
ctcttccgct tcctcgctca ctgactcgct 6900 
atcagctcac tcaaaggcgg taatacggtt 6960 
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atccacagaa 
caggaaccgt 
gcatcacaaa 
ccaggcgttt 
cggatacctg 
taggtatctc 
cgttcagccc 
acacgactta 
aggcggtgct 
atttggtatc 
atccggcaaa 
gcgcagaaaa 
gtggaacgaa 
ctagatcctt 
ttggtctgac 
tcgttcatcc 
accatctggc 
atcagcaata 
cgcctccatc 
tagtttgcgc 
tatggcttca 
gtgcaaaaaa 
agtgttatca 
aagatgcttt 
gcgaccgagt 
tttaaaagtg 
gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 
agatctcccg 
agccagtatc 
taagctacaa 
cgttttgcgc 
agttattaat 
gttacataac 
acgtcaataa 
tgggtggact 
agtacgcccc 
atgaccttat 
atggtg 



tcaggggata 
aaaaaggccg 
aatcgacgct 
ccccctggaa 
tccgcctttc 
agttcggtgt 
gaccgctgcg 
tcgccactgg 
acagag.ttct 
tgcgctctgc 
caaaccaccg 
aaaggatctc 
aactcacgtt 
ttaaattaaa 
agttaccaat 
atagttgcct 
cccagtgctg 
aaccagccag 
cagtctatta 
aacgttgttg 
ttcagctccg 
gcggttagct 
ctcatggtta 
tctgtgactg 
tgctcttgcc 
ctcatcattg 
tccagttcga 
agcgtttctg 
acacggaaat^ 
ggttattgtc 
gttccgcgca 
atcccctatg 
tgctccctgc 
caaggcaagg 
tgcttcgcga 
agtaatcaat 
ttacggtaaa 
tgacgtatgt 
atttacggta 
ctattgacgt 
gggactttcc 



acgcaggaaa 
cgttgctggc 
caagtcagag 
gctccctcgt 
tcccttcggg 
aggtcgttcg 
ccttatccgg 
cagcagccac 
tgaagtggtg 
tgaagccagt 
ctggtagcgg 
aagaagatcc 
aagggatttt 
aatgaagttt 
gcttaatcag 
gactccccgt 
caatgatacc 
ccggaagggc 
attgttgccg 
ccattgctac 
gttcccaacg 
ccttcggtcc 
tggcagcact 
gtgagtactc 
cggcgtcaat 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 
gtcgactctc 
ttgtgtgttg 
cttgaccgac 
tgtacgggcc 
tacggggtca 
tggcccgcct 
tcccatagta 
aactgcccac 
caatgacggt 
tacttggcag 



gaacatgtga 
gtttttccat 
gtggcgaaac 
gcgctctcct 
aagcgtggcg 
ctccaagctg 
taactatcgt 
tggtaacagg 
gcctaactac 
taccttcgga 
tggttttttt 
tttgatcttt 
ggtcatgaga 
taaatcaatc 
tgaggcacct 
cgtgtagata 
gcgagaccca 
cgagcgcaga 
ggaagctaga 
aggcatcgtg 
atcaaggcga 
tccgatcgtt 
gcataattct 
aaccaagtca 
acgggataat 
ttcggggcga 
tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatattt 
aaaagtgcca 
agtacaatct 
gaggtcgctg 
aattgcatga 
agatatacgc 
ttagttcata 
ggctgaccgc 
acgccaatag 
ttggcagtac 
aaatggcccg 
tacatctacg 



gcaaaaggcc 
aggctccgcc 
ccgacaggac 
gttccgaccc 
ctttctcaat 
ggctgtgtgc 
cttgagtcca 
attagcagag 
ggctacacta 
aaaagagttg 
gtttgcaagc 
tctacggggt 
ttatcaaaaa 
taaagtatat 
atctcagcga 
actacgatac 
cgctcaccgg 
agtggtcctg 
gtaagtagtt 
gtgtcacgct 
gttacatgat 
gtcagaagta 
cttactgtca 
ttctgagaat 
accgcgccac 
aaactctcaa 
aactgatctt 
caaaatgccg 
ctttttcaat 
gaatgtattt 
cctgacgtcg 
gctctgatgc 
agtagtgcgc 
agaatctgct 
gttgacattg 
gcccatatat 
ccaacgaccc 
ggactttcca 
atcaagtgta 
cctggcatta 
tattagtcat 



agcaaaaggc 
cccctgacga 
tataaagata 
tgccgcttac 
gctcacgctg 
acgaaccccc 
acccggtaag 
cgaggtatgt 
gaaggacagt 
gtagctcttg 
agcagattac 
ctgacgctca 
ggatcttcac 
atgagtaaac 
tctgtctatt 
gggagggctt 
ctccagattt 
caactttatc 
cgccagttaa 
cgtcgtttgg 
cccccatgtt 
agttggccgc 
tgccatccgt 
agtgtatgcg 
atagcagaac 
ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 
acggatcggg 
cgcatagtta 
gagcaaaatt 
tagggttagg 
attattgact 
ggagttccgc 
ccgcccattg 
ttgacgtcaa 
tcatatgcca 
tgcccagtac 
cgctattacc 



7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9426 



<210> 8 
<211> 12745 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 8 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 

agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 120 

ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 180 

gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 

agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 

tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 360 



8/53 



WO 2004/073646 



PCT/US2004/004914 



cgcccgaaca gggacttgat tgagagtgat 
tgttaagcag aactcctgct gacctaaata 
agtatctcta gtgaagcgga ctcgagctca 
aattacatct ggtgactctt cgcggacctt 
acaaggtagg agagattcta cagcaacatg 
gccattaaga gatgtagtaa tgttgctgta 
gaagggaatt tcagatgggc cattagaatg 
gatataccag agactttaga tcaactaagg 
gaaaaatttg gatctagcaa agaaattgat 
gtagcaggac ttttaaatat gacgggtgtc 
tcaaatggga ttagacacta ggccatctat 
tccacaggca tatcctattc aaacagtaaa 
aaaaatggtg tccattttta tggaaaaggc 
actatggttt actgccttct ctgcaaattt 
ggccgcacca gggtgcgctg cagataaaga 
agcagaatat gatcgcacac atccccctga 
agcagaaatt atgggtatag gattaactca 
agctaggatg cagtgtagag catggtatct 
agctaagtct cctcgagctg tgcagttaag 
tatagacaga ttgtttgccc aaatagatca 
tttaaaacag tcattgagca tagctaatgc 
ccttaagcca gaaagtaccc tagaagaaaa 
aggatataaa atgcaactct tggcagaagc 
aggatcagga ccagtgtgtt ttaattgtaa 
agaagtgaaa aaatgtaata aatgtggaaa 
aggaaataga aagaattgta caagggaaga 
attgggtagg atggatagga aatattccac 
tgggaatagg attaggagtg ttattattga 
taagaaattg tatccacaag atactaggat 
gagaagaaat acaaccacaa atggaattga 
aaaaagagga ggaatgatga agtatctcag 
gttcttccct ttgaggaagg tatgtcatat 
aagtatgtat tgtaaggtaa aaggaaaaga 
agtacattta tattggctca tgtccaatat 
gttattaata gtaatcaatt acggggtcat 
ttacataact tacggtaatt ggcccgcctg 
gtcaataatg acgtatgttc ccatagtaac 
ggtggagtat ttacggtaaa ctgcccactt 
tccggccccc tattgacgtc aatgacggta 
tgaccttacg ggactttggt acttggcagt 
tggtgatgcg gttttggcag tacaccaatg 
ttccaagtct ccaccccatt gacgtcaatg 
actttccaaa atgtcgtaat aaccccgccc 
ggtgggaggt ctatataagc agagctcgtt 
atccacgctg ttttgacctc catagaagac 
aacggtgcat tggaacgcgg attccccgtg 
tctataggca cacccctttg gctcttatgc 
acacccccgc tccttatgct ataggtgatg 
gaccattatt gaccactccc ctattggtga 
gctctttgcc acaactatct ctattggcta 
cacggactct gtatttttac aggatggggt 
aacaacgccg tcccccgtgc ccgcagtttt 
atctcgggta cgtgttccgg acatgggctc 
gagccctggt cccatgcctc cagcggctca 
gtggaggcca gacttaggca cagcacaatg 
gtggcggtag ggtatgtgtc tgaaaatgag 
atggaagact taaggcagcg gcagaagaag 
aagagtcaga ggtaactccc gttgcggttc 
cagtactcgt tgctgccgcg cgcgccacca 
tcctttccat gggtcttttc tgcagtcacc 
ttcactggcc gtcgttttac aacgtcgtga 



tgaggaagtg aagctagagc aatagaaagc 420 
gggaagcagt agcagacgct gctaacagtg 480 
taatcaagtc attgtttaaa ggcccagata 540 
caagccagga gattcgccga gggacagtca 600 
gggaatggac aggggcgaga ttggaaaatg 660 
ggagtagggg ggaagagtaa aaaatttgga 720 
gctaatgtat ctacaggacg agaacctggt 780 
ttggttattt gcgatttaca agaaagaaga 840 
atggcaattg tgacattaaa agtctttgcg 900 
tactgctgct gcagctgaaa atatgtattc 960 
gaaagaagca ggtggaaaag aggaaggccc 1020 
tggagtacca caatatgtag cacttgaccc 1080 
aagagaagga ctaggaggtg aggaagttca 1140 
aacacctact gacatggcca cattaataat 1200 
aatattggat gaaagcttaa agcaactgac 1260 
tgctcccaga ccattaccct attttactgc 1320 
agaacaacaa gcagaagcaa gatttgcacc 1380 
cgaggcatta ggaaaattgg ctgccataaa 1440 
acaaggagct aaggaagatt attcatcctt 1500 
agaacaaaat acagctgaag ttaagttata 1560 
taatgcagac tgtaaaaagg caatgagcca 1620 
gttgagagct tgtcaagaaa taggctcacc 1680 
tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aaaaccagga catctagcaa gacaatgtag 1800 
acctggtcat gtagctgcca aatgttggca 1860 
aagggataca acaattacaa aagtgggaag 1920 
aatatttaaa gggactattgggaggtatct 1980 
ttttatgttt acctacattg gttgattgta 2040 
acacagtaat tgcaatgcct gaagtagaag 2100 
ggagaaatgg taggcaatgt ggcatgtctg 2160 
acttatttta taagggagat actgtgctga 2220 
gaatccattt cgaatcaaat caaactaata 2280 
caaagaagaa gaagaaagaa gaaagccttc 2340 
gaccgccatg ttgacattga ttattgacta 2400 
tagttcatag cccatatatg gagttccgcg 2460 
ctgaccgccc aacgaccccc gcccattgac 2520 
gccaataggg actttccatt gacgtcaatg 2580 
ggcagtacat caagtgtatc atatgccaag 2640 
aatggcccgc ctggcattat gcccagtaca 2700 
acatctacgt attagtcatc gctattacca 2760 
ggcgtggata gcggtttgac tcacggggat 2820 
ggagtttgtt ttggcaccaa aatcaacggg 2880 
cgttgacgca aatgggcggt aggcgtgtac 2940 
tagtgaaccg tcagatcgcc tggagacgcc 3000 
accgggaccg atccagcctc cgcggccggg 3060 
ccaagagtga cgtaagtacc gcctatagac 3120 
atgctatact gtttttggct tggggcctat 3180 
gtatagctta gcctataggt gtgggttatt 3240 
cgatactttc cattactaat ccataacatg 3300 
tatgccaata ctctgtcctt cagagactga 3360 
cccatttatt atttacaaat tcacatatac 3420 
tattaaacat agcgtgggat ctccacgcga 3480 
ttctccggta gcggcggagc ttccacatcc 3540 
tggtcgctcg gcagctcctt gctcctaaca 3600 
cccaccacca ccagtgtgcc gcacaaggcc 3660 
ctcggagatt gggctcgcac cgtgacgcag 3720 
atgcaggcag ctgagttgtt gtattctgat 3780 
tgttaacggt ggagggcagt gtagtctgag 3840 
gacataatag ctgacagact aacagactgt 3900 
gtcgtcgaag cttatgacca tgattacgga 3960 
ctgggaaaac cctggcgtta cccaacttaa 4020 
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tcgccttgca gcacatcccc ctttcgccag 
tcgcccttcc caacagttgc gcagcctgaa 
accagaagcg gtgccggaaa gctggctgga 
cgtcccctca aactggcaga tgcacggtta 
tcccattacg gtcaatccgc cgtttgttcc 
cacatttaat gttgatgaaa gctggctaca 
cgttaactcg gcgtttcatc tgtggtgcaa 
tcgtttgccg tctgaatttg acctgagcgc 
ggtgatggtg ctgcgttgga gtgacggcag 
gagcggcatt ttccgtgacg tctcgttgct 
ccatgttgcc actcgcttta atgatgattt 
gatgtgcggc gagttgcgtg actacctacg 
gcaggtcgcc agcggcaccg cgcctttcgg 
tgccgatcgc gtcacactac gtctgaacgt 
cccgaatctc tatcgtgcgg tggttgaact 
agaagcctgc gatgtcggtt tccgcgaggt 
cggcaagccg ttgctgattc gaggcgttaa 
ggtcatggat gagcagacga tggtgcagga 
cgccgtgcgc tgttcgcatt atccgaacca 
cggcctgtat gtggtggatg aagccaatat 
tctgaccgat gatccgcgct ggctaccggc 
gcgcgatcgt aatcacccga gtgtgatcat 
cgctaatcac gacgcgctgt atcgctggat 
gtatgaaggc ggcggagccg acaccacggc 
cgtggatgaa gaccagccct tcccggctgt 
gctacctgga gagacgcgcc cgctgatcct 
tcttggcggt ttcgctaaat : actggcaggc 
cttcgtctgg gactgggtgg atcagtcgct 
gtcggcttac ggcggtgatt ttggcgatac 
tctggtcttt gccgaccgca cgccgcatcc 
gtttttccag ttccgtttat ccgggcaaac 
tcatagcgat aacgagctcc tgcactggat 
cggtgaagtg cctctggatg tcgctccaca 
accgcagccg gagagcgccg ggcaactctg 
gaccgcatgg tcagaagccg ggcacatcag 
cctcagtgtg acgctccccg ccgcgtccca 
ggatttttgc atcgagctgg gtaataagcg 
ttcacagatg tggattggcg ataaaaaaca 
ccgtgcaccg ctggataacg acattggcgt 
ctgggtcgaa cgctggaagg cggcgggcca 
cacggcagat acacttgctg atgcggtgct 
ggggaaaacc ttatttatca gccggaaaac 
gattaccgtt gatgttgaag tggcgagcga 
ctgccagctg gcgcaggtag cagagcgggt 
ctatcccgac cgccttactg ccgcctgttt 
gtataccccg tacgtcttcc cgagcgaaaa 
ttatggccca caccagtggc gcggcgactt 
gcaactgatg gaaaccagcc atcgccatct 
tatcgacggt ttccatatgg ggattggtgg 
ggaattccag ctgagcgccg gtcgctacca 
cgatcgacca gagctgagat cctacaggag 
gatgatgaca gagttagaag atcgcttcag 
agacagcaca gtagattctg aagatgaacc 
tgagtattgg aaccctgaag aaatagaaag 
caaatgataa aaggaaatag ctgagcatga 
ccgcaaaacc acatcctatg gaaagcttgc 
gtatataacc agtgctttgt gaaacttcga 
tcccttgagg ctcccacaga tacaataaat 
gtaatctttt ttacctgtga ggtctcggaa 
gagcatgcat ctagagggcc ctattctata 
agcctcgact gtgccttcta gttgccagcc 



ctggcgtaat agcgaagagg cccgcaccga 4080 
tggcgaatgg cgctttgcct ggtttccggc 4140 
gtgcgatctt cctgaggccg atactgtcgt 4200 
cgatgcgccc atctacacca acgtaaccta 4260 
cacggagaat ccgacgggtt gttactcgct 4320 
ggaaggccag acgcgaatta tttttgatgg 4380 
cgggcgctgg gtcggttacg gccaggacag 4440 
atttttacgc gccggagaaa accgcctcgc 4500 
ttatctggaa gatcaggata tgtggcggat 4560 
gcataaaccg actacacaaa tcagcgattt 4620 
cagccgcgct gtactggagg ctgaagttca 4680 
ggtaacagtt tctttatggc agggtgaaac 4740 
cggtgaaatt atcgatgagc gtggtggtta 4800 
cgaaaacccg aaactgtgga gcgccgaaat 4860 
gcacaccgcc gacggcacgc tgattgaagc 4920 
gcggattgaa aatggtctgc tgctgctgaa 4980 
ccgtcacgag catcatcctc tgcatggtca 5040 
tatcctgctg atgaagcaga acaactttaa 5100 
tccgctgtgg tacacgctgt gcgaccgcta 5160 
tgaaacccac ggcatggtgc caatgaatcg 5220 
gatgagcgaa cgcgtaacgc gaatggtgca 5280 
ctggtcgctg gggaatgaat caggccacgg 5340 
caaatctgtc gatccttccc gcccggtgca 5400 
caccgatatt atttgcccga tgtacgcgcg 5460 
gccgaaatgg tccatcaaaa aatggctttc 5520 
ttgcgaatac gcccacgcga tgggtaacag 5580 
gtttcgtcag tatccccgtt tacagggcgg 5640 
gattaaatat gatgaaaacg gcaacccgtg 5700 
gccgaacgat cgccagttct gtatgaacgg 5760 
agcgctgacg gaagcaaaac accagcagca 5820 
catcgaagtg accagcgaat acctgttccg 5880 
ggtggcgctg gatggtaagc cgctggcaag 5940 
aggtaaacag ttgattgaac tgcctgaact 6000 
gctcacagta cgcgtagtgc aaccgaacgc 6060 
cgcctggcag cagtggcgtc tggcggaaaa 6120 
cgccatcccg catctgacca ccagcgaaat 6180 
ttggcaattt aaccgccagt caggctttct 6240 
actgctgacg ccgctgcgcg atcagttcac 6300 
aagtgaagcg acccgcattg accctaacgc 6360 
ttaccaggcc gaagcagcgt tgttgcagtg 6420 
gattacgacc gctcacgcgt ggcagcatca 6480 
ctaccggatt gatggtagtg gtcaaatggc 6540 
tacaccgcat ccggcgcgga ttggcctgaa 6600 
aaactggctc ggattagggc cgcaagaaaa 6660 
tgaccgctgg gatctgccat tgtcagacat 6720 
cggtctgcgc tgcgggacgc gcgaattgaa 6780 
ccagttcaac atcagccgct acagtcaaca 6840 
gctgcacgcg gaagaaggca catggctgaa 6900 
cgacgactcc tggagcccgt cagtatcggc 6960 
ttaccagttg gtctggtgtc aaaaataact 7020 
tccagggctg gagagaaaac ctctgaagag 7080 
gaagctattt ggcacgactt ctacaacggg 7140 
tcctaaaaaa gaaaaaaggg tggactggga 7200 
aatgcttatg gactagggac tgtttacgaa 7260 
ctcatagtta aagcgctagc agctgcctaa 7320 
taatgacgta taagttgttc cattgtaaga 7380 
ggagtctctt tgttgaggac ttttgagttc 7440 
atttgagatt gaaccctgtc gagtatctgt 7500 
tccgggccga gaacttcgca gcggccgctc 7560 
gtgtcaccta aatgctagag ctcgctgatc 7620 
atctgttgtt tgcccctccc ccgtgccttc 7680 
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cttgaccctg gaaggtgcca ctcccactgt 
gcattgtctg agtaggtgtc attctattct 
ggaggattgg gaagacaata gcaggcatgc 
ggcggaaaga accagctggg gctcgagggg 
aagcgcggcg ggtgtggtgg ttacgcgcag 
gcccgctcct ttcgctttct tcccttcctt 
agctctaaat cggggcatcc ctttagggtt 
caaaaaactt gattagggtg atggttcacg 
tcgccctttg acgttggagt ccacgttctt 
aacactcaac cctatctcgg tctattcttt 
ctattggtta aaaaatgagc tgatttaaca 
aacgtttaca atttaaatat ttgcttatac 
atcaaccggg gtgggtaccg agctcgaatt 
aagtccccag gctccccagg caggcagaag 
aaccaggtgt ggaaagtccc caggctcccc 
caattagtca gcaaccatag tcccgcccct 
cagttccgcc cattctccgc cccatggctg 
ggccgcctcg gcctctgagc tattccagaa 
cttttgcaaa aagctcccgg gagcttggat 
atgaggatcg tttcgcatga ttgaacaaga 
ggtggagagg ctattcggct atgactgggc 
cgtgttccgg ctgtcagcgc aggggcgccc 
tgccctgaat gaactgcagg acgaggcagc 
tccttgcgca gctgtgctcg acgttgtcac 
cgaagtgccg gggcaggatc tcctgtcatc 
catggctgat gcaatgcggc ggctgcatac 
ccaagcgaaa catcgcatcg agcgagcacg 
ggatgatctg gacgaagagc atcaggggct 
ggcgcgcatg cccgacggcg aggatctcgt 
tatcatggtg gaaaatggcc gcttttctgg 
ggaccgctat caggacatag cgttggctac 
atgggctgac cgcttcctcg tgctttacgg 
cttctatcgc cttcttgacg agttcttctg 
caagcgacgc ccaacctgcc atcacgagat 
ttgggcttcg gaatcgtttt ccgggacgcc 
atgctggagt tcttcgccca ccccaacttg 
agcaatagca tcacaaattt cacaaataaa 
ttgtccaaac tcatcaatgt atcttatcat 
cgtaatcatg gtcatagctg tttcctgtgt 
acatacgagc cggaagcata aagtgtaaag 
cattaattgc gttgcgctca ctgcccgctt 
attaatgaat cggccaacgc gcggggagag 
cctcgctcac tgactcgctg cgctcggtcg 
caaaggcggt aatacggtta tccacagaat 
caaaaggcca gcaaaaggcc aggaaccgta 
ggctccgccc ccctgacgag catcacaaaa 
cgacaggact ataaagatac caggcgtttc 
ttccgaccct gccgcttacc ggatacctgt 
tttctcaatg ctcacgctgt aggtatctca 
gctgtgtgca cgaacccccc gttcagcccg 
ttgagtccaa cccggtaaga cacgacttat 
ttagcagagc gaggtatgta ggcggtgcta 
gctacactag aaggacagta tttggtatct 
aaagagttgg tagctcttga tccggcaaac 
tttgcaagca gcagattacg cgcagaaaaa 
ctacggggtc tgacgctcag tggaacgaaa 
tatcaaaaag gatcttcacc tagatccttt 
aaagtatata tgagtaaact tggtctgaca 
tctcagcgat ctgtctattt cgttcatcca 
ctacgatacg ggagggctta ccatctggcc 
gctcaccggc tccagattta tcagcaataa 



cctttcctaa taaaatgagg aaattgcatc 7740 
999999 tggg gtggggcagg acagcaaggg 7800 
tggggatgcg gtgggctcta tggcttctga 7860 
ggatccccac gcgccctgta gcggcgcatt 7920 
cgtgaccgct acacttgcca gcgccctagc 7980 
tctcgccacg ttcgccggct ttccccgtca 8040 
ccgatttagt gctttacggc acctcgaccc 8100 
tagtgggcca tcgccctgat agacggtttt 8160 
taatagtgga ctcttgttcc aaactggaac 8220 
tgatttataa gggattttgg ggatttcggc 8280 
aaaatttaac gcgaatttta acaaaatatt 8340 
aatcttcctg tttttggggc ttttctgatt 8400 
ctgtggaatg tgtgtcagtt agggtgtgga 8460 
tatgcaaagc atgcatctca attagtcagc 8520 
agcaggcaga agtatgcaaa gcatgcatct 8580 
aactccgccc atcccgcccc taactccgcc 8640 
actaattttt tttatttatg cagaggccga 8700 
gtagtgagga ggcttttttg gaggcctagg 8760 
atccattttc ggatctgatc aagagacagg 8820 
tggattgcac gcaggttctc cggccgcttg 8880 
acaacagaca atcggctgct ctgatgccgc 8940 
ggttcttttt gtcaagaccg acctgtccgg 9000 
gcggctatcg tggctggcca cgacgggcgt 9060 
tgaagcggga agggactggc tgc tat tggg 9120 
tcaccttgct cctgccgaga aagtatccat 9180 
gcttgatccg gctacctgcc cattcgacca 9240 
tactcggatg gaagceggtc ttgtcgatca 9300 
cgcgccagcc gaactgttcg ccaggctcaa 9360 
cgtgacccat ggcgatgcct gcttgccgaa 9420 
attcatcgac tgtggccggc tgggtgtggc 9480 
ccgtgatatt gctgaagagc ttggcggcga 9540 
tatcgccgct cccgattcgc agcgcatcgc 9600 
agcgggactc tggggttcga aatgaccgac 9660 
ttcgattcca ccgccgcctt ctatgaaagg 9720 
ggctggatga tcctccagcg cggggatctc 9780 
tttattgcag cttataatgg ttacaaataa 9840 
gcattttttt cactgcattc tagttgtggt 9900 
gtctggatcc cgtcgacctc gagagcttgg 9960 
gaaattgtta tccgctcaca attccacaca 10020 
cctggggtgc ctaatgagtg agctaactca 10080 
tccagtcggg aaacctgtcg tgccagctgc 10140 
gcggtttgcg tattgggcgc tcttccgctt 10200 
ttcggctgcg gcgagcggta tcagctcact 10260 
caggggataa cgcaggaaag aacatgtgag 10320 
aaaaggccgc gttgctggcg tttttccata 10380 
atcgacgctc aagtcagagg tggcgaaacc 10440 
cccctggaag ctccctcgtg cgctctcctg 10500 
ccgcctttct cccttcggga agcgtggcgc 10560 
gttcggtgta ggtcgttcgc tccaagctgg 1062 0 
accgctgcgc cttatccggt aactatcgtc 10680 
cgccactggc agcagccact ggtaacagga 10740 
cagagttctt gaagtggtgg cctaactacg 10800 
gcgctctgct gaagccagtt accttcggaa 10860 
aaaccaccgc tggtagcggt ggtttttttg 10920 
aaggatctca agaagatcct ttgatctttt 10980 
actcacgtta agggattttg gtcatgagat 11040 
taaattaaaa atgaagtttt aaatcaatct 11100 
gttaccaatg cttaatcagt gaggcaccta 11160 
tagttgcctg actccccgtc gtgtagataa 11220 
ccagtgctgc aatgataccg cgagacccac 11280 
accagccagc cggaagggcc gagcgcagaa 11340 
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gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg gaagctagag 11400 
taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca ggcatcgtgg 11460 
tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga tcaaggcgag 11520 
ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct ccgatcgttg 11580 
tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg cataattctc 11640 
ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca accaagtcat 11700 
tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata cgggataata 11760 
ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa 11820 
aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact cgtgcaccca 11880 
actgatcttc agcatcttfct actttcacca gcgtttctgg gtgagcaaaa acaggaaggc 11940 
aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc atactcttcc 12000 
tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga tacatatttg 12060 
aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac 12120 
ctgacgtcga cggatcggga gatctcccga tcccctatgg tcgactctca gtacaatctg 12180 
ctctgatgcc gcatagttaa gccagtatct gctccctgct tgtgtgttgg aggtcgctga 12240 
gtagtgcgcg agcaaaattt aagctacaac aaggcaaggc ttgaccgaca attgcatgaa 12300 
gaatctgctt agggttaggc gttttgcgct gcttcgcgat gtacgggcca gatatacgcg 12360 
ttgacattga ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag 12420 
cccatatatg gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc 12480 
caacgacccc cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg 12540 
gactttccat tgacgtcaat gggtggacta tttacggtaa actgcccact tggcagtaca 12600 
tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta aatggcccgc 12660 
ctggcattat gcccagtaca tgaccttatg ggactttcct acttggcagt acatctacgt 12720 
attagtcatc gctattacca tggtg 12745 



<210> 9 
<211> 1200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 9 

atggacagca gcgctgcccc cacgaacgcc 
agttgctccc cagcacccag ccccggttcc 
ctgtccgacc catgcggtcc gaaccgcacc 
ccgaccggca gtccctccat gatcacggcc 
tgcgtggtgg ggctcttcgg aaacttcctg 
atgaagactg ccaccaacat ctacattttc 
agtaccctgc ccttccagag tgtgaattac 
ctttgcaaga tagtgatctc catagattac 
tgcaccatga gtgttgatcg atacattgca 
cgtactcccc gaaatgccaa aattatcaat 
ggtcttcctg taatgttcat ggctacaaca 
ctaacattct ctcatccaac ctggtactgg 
ttcgccttca ttatgccagt gctcatcatt 
ctcaagagtg tccgcatgct ctctggctcc 
accaggatgg tgctggtggt ggtggctgtg 
tacgtcatca ttaaagcctt ggttacaatc 
cacttctgca ttgctctagg ttacacaaac 
ctggatgaaa acttcaaacg atgcttcaga 
gagcaacaaa actccactcg aattcgtcag 
acagtggata gaactaatca tcagctagaa 



<210> 10 
<211> 400 



agcaattgca ctgatgcctt ggcgtactca 60 
tggatcaact tgtcccactt agatggcaac 120 
gacctgggcg ggagagacag cctgtgccct 180 
atcacgatca tggccctcta ctccatcgtg 240 
gtcatgtatg tgattgtcag atacaccaag 300. 
aaccttgctc tggcagatgc cttagccacc 360 
ctaatgggaa catggccatt tggaaccatc 420 
tataacatgt tcaccagcat attcaccctc 480 
gtctgccacc ctgtcaaggc cttagatttc 540 
gtctgcaact ggatcctctc ttcagccatt 600 
aaatacaggc aaggttccat agattgtaca 660 
gaaaacctgc tgaagatctg tgttttcatc 720 
accgtgtgct atggactgat gatcttgcgc 780 
aaagaaaagg acaggaatct tcgaaggatc 840 
ttcatcgtct gctggactcc cattcacatt 900 
ccagaaacta cgttccagac tgtttcttgg 960 
agctgcctca acccagtcct ttatgcattt 1020 
gagttctgta tcccaacctc ttccaacatt 1080 
aacactagag accacccctc cacggccaat 1140 
aatctggaag cagaaactgc tccgttgccc 1200 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 10 



Met 


Asp 


Ser 


Ser 


Ala Ala 


Pro 


Thr 


Asn 


Ala 


Ser 


Asn 


Cys Thr Asp 


Ala 


1 








5 








10 










15 




Leu 


Ala 


Tyr 


Ser 


Ser Cys 


Ser 


Pro 


Ala 


Pro 


Ser 


Pro 


Gly Ser Trp 


He 








20 








25 










30 






Asn 


Leu 


Ser 


His 


Leu Asp 


Gly 


Asn 


Leu 


Ser 


Asp 


Pro 


Cys 


Gly Pro 


Asn 






35 








40 










45 








Arg 


Thr 


Asp 


Leu 


Gly Gly 


Arg 


Asp 


Ser 


Leu 


Cys 


Pro 


Pro Thr Gly 


Ser 




50 








55 










60 










Pro 


Ser 


Met 


He 


Thr Ala 


He 


Thr 


He 


Met 


Ala 


Leu 


Tyr 


Ser 


He 


Val 


65 








70 










75 










80 


Cys 


Val 


Val 


Gly 


Leu Phe 


Gly 


Asn 


Phe 


Leu 


Val 


Met 


Tyr Val 


He 


Val 










85 








90 










95 




Arg 


Tyr 


Thr 


Lys 
100 


Met Lys 


Thr 


Ala 


Thr 
105 


Asn 


He 


Tyr 


He 


Phe 
110 


Asn 


Leu 


Ala 


Leu 


Ala 
115 


Asp 


Ala Leu 


Ala 


Thr 
120 


Ser 


Thr 


Leu 


Pro 


Phe 
125 


Gin 


Ser 


Val 


Asn 


Tyr 


Leu 


Met 


Gly Thr 


Trp 


Pro 


Phe 


Gly 


Thr 


lie 


Leu Cys 


Lys 


He 




130 








135 










140 










Val 


He 


Ser 


He 


Asp Tyr 


Tyr 


Asn 


Met 


Phe 


Thr 


Ser 


He 


Phe 


Thr 


Leu 


145 








150 










155 










160 


Cys 


Thr 


Met 


Ser 


Val Asp 
165 


Arg 


Tyr 


lie 


Ala 
170 


Val 


Cys 


His 


Pro 


Val 
175 


Lys 


Ala 


Leu 


Asp 


Phe 
180 


Arg Thr 


Pro 


Arg 


Asn 
185 


Ala 


Lys 


He 


He 


Asn 
190 


Val 


Cys 


Asn 


Trp 


He 
195 


Leu 


Ser Ser 


Ala 


He 
200 


Gly 


Leu 


Pro 


Val 


Met 
205 


Phe 


Met 


Ala 


Thr 


Thr 
210 


Lys 


Tyr 


Arg Gin 


Gly 
215 


Ser 


He 


Asp 


Cys 


Thr 
220 


Leu 


Thr 


Phe 


Ser 


His 


Pro 


Thr 


Trp 


Tyr Trp 


Glu 


Asn 


Leu 


Leu 


Lys 


He 


Cys 


Val 


Phe 


He 


225 








230 










235 










240 


Phe 


Ala 


Phe 


He 


Met Pro 
245 


Val 


Leu 


He 


He 
250 


Thr 


Val 


Cys 


Tyr 


Gly 
255 


Leu 


Met 


He 


Leu 


Arg 


Leu Lys 


Ser 


Val 


Arg 


Met 


Leu 


Ser 


Gly Ser Lys 


Glu 








260 






265 










270 






Lys 


Asp 


Arg 
275 


Asn 


Leu Arg 


Arg 


He 
280 


Thr 


Arg 


Met 


Val 


Leu 
285 


Val 


Val 


Val 


Ala 


Val 


Phe 


He 


Val Cys 


Trp 


Thr 


Pro 


He 


His 


He 


Tyr Val 


He 


He 




290 








295 










300 










Lys 


Ala 


Leu 


Val 


Thr He 


Pro 


Glu 


Thr 


Thr 


Phe 


Gin 


Thr 


Val 


Ser 


Trp 


305 








310 










315 










320 


His 


Phe 


Cys 


He 


Ala Leu 
325 


Gly 


Tyr 


Thr 


Asn 
330 


Ser 


Cys 


Leu 


Asn 


Pro 
335 


Val 


Leu 


Tyr 


Ala 


Phe 


Leu Asp 


Glu 


Asn 


Phe 


Lys 


Arg 


Cys 


Phe Arg Glu 


Phe 








340 








345 










350 






Cys 


He 


Pro 


Thr 


Ser Ser 


Asn 


He 


Glu 


Gin 


Gin 


Asn 


Ser Thr Arg 


He 






355 








360 










365 








Arg 


Gin 
370 


Asn 


Thr 


Arg Asp 


His 
375 


Pro 


Ser 


Thr 


Ala 


Asn 
380 


Thr 


Val 


Asp 


Arg 


Thr 


Asn 


His 


Gin 


Leu Glu 


Asn 


Leu 


Glu 


Ala 


Glu 


Thr 


Ala 


Pro 


Leu 


Pro 


385 








390 










395 










400 



<210> 11 
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<211> 1986 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 11 

tcctcctctg ctcgcccaat ccttccaacc ccctatggtg gtatggctga cacagaaaat 60 
gtctgctcct gtatgggaca tttgcccctc ttctccaaat ataagacagg atgaggccta 120 
gcttttgctg ctccaaagtt ttaaaagaac acattgcacg gcatttaggg actctaaagg 180 
gtggaggagg aatgagggaa ttgcatcatg ccaaggctgg tec teat cca tcactgcttc 240 
cagggcccag agtggcttcc aggaggtatt cttacaaagg aagcccgatc tgtagctaac 300 
actcagagcc cattttcctg cgttaacccc tcccgacctc atatacagga gtaacatgat 360 
cagtgacctg ggggagctgg ccaaactgcg ggacctgccc aagctgaggg ccttggtgct 420 
gctggacaac ccctgtgccg atgagactga ctaccgccag gaggecctgg tgcagatggc 480 
acacctagag cgcctagaca aagagtacta tgaggacgag gaeegggcag aagctgagga 540 
gatccgacag aggctgaagg aggaacagga gcaagaactc gacccggacc aagacatgga 600 
accgtacctc ccgccaactt agtggctcct ctagcctgca gggacagtaa aggtgatggc 660 
aggaaggcag cccccggagg tcaaaggctg ggcacgcggg aggagaggee agagtcagag 720 
getgegggta tctcagatat gaaggaaaga tgagagaggc tcaggaagag gtaagaaaag 780 
acacaagaga ccagagaagg gagaagaatt agagagggag gcagaggacc gctgtctcta 840 
cagacatagc tggtagagac tgggaggaag ggatgaaccc tgagcgcatg aagggaagga 900 
ggtggctggt ggtatatgga ggatgtagct gggccaggga aaagatcctg cactaaaaat 960 
ctgaagctaa aaataacagg acaeggggtg gagaggegaa aggagggcag attgaggcag 1020 
agagactgag aggcctgggg atgtgggcat teeggtaggg cacacagttc acttgtcttc 1080 
tctttttcca ggaggecara gatgetgace tcaagaactc ataatacccc agtggggacc 1140 
accgcattca tagccctgtt acaagaagtg ggagatgttc ctttttgtcc cagactggaa 1200 
atccattaca tcccgaggct caggttctgt ggtggtcatc tctgtgtggc ttgttctgtg 1260 
ggcctaccta aagtcctaag cacagctctc aagcagatcc gaggegacta agatgetagt 1320 
aggggttgtc tggagagaag agecgaggag gtgggctgtg atggatcagt tcagctttca 1380 
aataaaaagg cgtttttata ttctgtgtcg agttcgtgaa cccctgtggt gggcttctcc 1440 
atctgtctgg gttagtacct gccactatac tggaataagg ggacgcctgc ttccctcgag 1500 
ttggctggac aaggttatga gcatccgtgt acttatgggg ttgecagett ggtcctggat 1560 
cgcccgggcc cttcccccac ccgttcggtt ccccaccacc acccgcgctc gtacgtgcgt 1620 
ctccgcctgc agctcttgac teategggge ccccgggtca catgcgctcg ctcggctcta 1680 
taggcgccgc cccctgccca ccccccgccc gcgctgggag ccgcagccgc cgccactcct 1740 
gctctctctg cgccgccgcc gtcaccaccg ccaccgccac eggctgagtc tgcagtcctc 1800 
gaggaactga aaaaccagaa agttaactgg taagtttagt ctttttgtct tttatttcag 1860 
gtcccggatc cggtggtggt gcaaatcaaa gaactgctcc tcagtggatg ttgectttae 1920 
ttctaggect gtacggaagt gttacttctg ctctaaaagc tgcggaattg tacccgcggc 1980 
caagct 1986 

<210> 12 
<211> 5982 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 12 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 
agtctccacc ecattgaegt caatgggagt ttgttttggc accaaaatca aegggacttt 120 
ccaaaatgtc gtaacaactc cgccccattg aegcaaatgg gcggtaggcg tgtacggtgg 180 
gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 
agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 
tctgtgtaat cttttttacc tgtgaggtct eggaatcegg gecgagaact tcgcagttgg 360 
cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 420 
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tgttaagcag aactcctgct gacctaaata 
agtatctcta gtgaagcgga ctcgagctca 
aattacatct ggtgactctt cgcggacctt 
acaaggtagg agagattcta cagcaacatg 
gccattaaga gatgtagtaa tgttgctgta 
gaagggaatt tcagatgggc cattagaatg 
gatataccag agactttaga tcaactaagg 
gaaaaatttg gatctagcaa agaaattgat 
gtagcaggac ttttaaatat gacgggtgtc 
tcaaatggga ttagacacta ggccatctat 
tccacaggca tatcctattc aaacagtaaa 
aaaaatggtg tccattttta tggaaaaggc 
actatggttt actgccttct ctgcaaattt 
ggccgcacca gggtgcgctg cagataaaga 
agcagaatat gatcgcacac atccccctga 
agcagaaatt atgggtatag gattaactca 
agctaggatg cagtgtagag catggtatct 
agctaagtct cctcgagctg tgcagttaag 
tatagacaga ttgtttgccc aaatagatca 
tttaaaacag tcattgagca tagctaatgc 
ccttaagcca gaaagtaccc tagaagaaaa 
aggatataaa atgcaactct tggcagaagc 
aggatcagga ccagtgtgtt ttaattgtaa 
agaagtgaaa aaatgtaata aatgtggaaa 
aggaaataga aagaattgta caagggaaga 
attgggtagg atggatagga aatattccac 
tgggaatagg attaggagtg ttattattga 
taagaaattg tatccacaag atactaggat 
gagaagaaat acaaccacaa atggaattga 
aaaaagagga ggaatgatga agtatctcag 
gttcttccct ttgaggaagg tatgtcatat 
ctgatcagcc tcgactgtgc cttctagttg 
gccttccttg accctggaag gtgccactcc 
tgcatcgcat tgtctgagta ggtgtcattc 
caagggggag gattgggaag acaatagcag 
gactgggatg agtattggaa ccctgaagaa 
tttacgaaca aatgataaaa ggaaatagct 
ctgcctaacc gcaaaaccac atcctatgga 
ttgtaagagt atataaccag tgctttgtga 
ttgagttctc ccttgaggct cccacagata 
gtatctgtgt aatctttttt acctgtgagg 
ggccgctcat gaccgaccaa gcgacgccca 
ccgccttcta tgaaaggttg ggcttcggaa 
tccagcgcgg ggatctcatg ctggagttct 
ataatggtta caaataaagc aatagcatca 
tgcattctag ttgtggtttg tccaaactca 
cgacctcgag agcttggcgt aatcatggtc 
gctcacaatt ccacacaaca tacgagccgg 
atgagtgagc taactcacat taattgcgtt 
cctgtcgtgc cagctgcatt aatgaatcgg 
tgggcgctct tccgcttcct cgctcactga 
agcggtatca gctcactcaa aggcggtaat 
aggaaagaac atgtgagcaa aaggccagca 
gctggcgttt ttccataggc tccgcccccc 
tcagaggtgg cgaaacccga caggactata 
cctcgtgcgc tctcctgttc cgaccctgcc 
ttcgggaagc gtggcgcttt ctcaatgctc 
cgttcgctcc aagctgggct gtgtgcacga 
atccggtaac tatcgtcttg agtccaaccc 
agccactggt aacaggatta gcagagcgag 
gtggtggcct aactacggct acactagaag 



gggaagcagt agcagacgct gctaacagtg 480 
taatcaagtc attgtttaaa ggcccagata 540 
caagccagga gattcgccga gggacagtca 600 
gggaatggac aggggcgaga ttggaaaatg 660 
ggagtagggg ggaagagtaa aaaatttgga 720 
gctaatgtat ctacaggacg agaacctggt 780 
ttggttattt gcgatttaca agaaagaaga 840 
atggcaattg tgacattaaa agtctttgcg 900 
tactgctgct gcagctgaaa atatgtattc 960 
gaaagaagca ggtggaaaag aggaaggccc 1020 
tggagtacca caatatgtag cacttgaccc 1080 
aagagaagga ctaggaggtg aggaagttca 1140 
aacacctact gacatggcca cattaataat 1200 
aatattggat gaaagcttaa agcaactgac 1260 
tgctcccaga ccattaccct attttactgc 1320 
agaacaacaa gcagaagcaa gatttgcacc 1380 
cgaggcatta ggaaaattgg ctgccataaa 1440 
acaaggagct aaggaagatt attcatcctt 1500 
agaacaaaat acagctgaag ttaagttata 1560 
taatgcagac tgtaaaaagg caatgagcca 1620 
gttgagagct tgtcaagaaa taggctcacc 1680 
tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aaaaccagga catctagcaa gacaatgtag 1800 
acctggtcat gtagctgcca aatgttggca 1860 
aagggataca acaattacaa aagtgggaag 192 0 
aatatttaaa gggactattg ggaggtatct 1980 
ttttatgttt acctacattg gttgattgta 2040 
acacagtaat tgcaatgcct gaagtagaag 2100 
ggagaaatgg taggcaatgt ggcatgtctg 2160 
acttatttta taagggagat actgtgctga 2220 
gaatccattt cgaaatcaaa ttagagctcg 2280 
ccagccatct gttgtttgcc cctcccccgt 2340 
cactgtcctt tcctaataaa atgaggaaat 2400 
tattctgggg ggtggggtgg ggcaggacag 2460 
gcatgctggg gtaaaaaaga aaaaagggtg 2520 
atagaaagaa tgcttatgga ctagggactg 2580 
gagcatgact catagttaaa gcgctagcag 2640 
aagcttgcta atgacgtata agttgttcca 2700 
aacttcgagg agtctctttg ttgaggactt 2760 
caataaatat ttgagattga accctgtcga 2820 
tctcggaatc cgggccgaga acttcgcagc 2 880 
acctgccatc acgagatttc gattccaccg 2940 
tcgttttccg ggacgccggc tggatgatcc 3000 
tcgcccaccc caacttgttt attgcagctt 3060 
caaatttcac aaataaagca tttttttcac 3120 
tcaatgtatc ttatcatgtc tggatcccgt 3180 
atagctgttt cctgtgtgaa attgttatcc 3240 
aagcataaag tgtaaagcct ggggtgccta 3300 
gcgctcactg cccgctttcc agtcgggaaa 3360 
ccaacgcgcg gggagaggcg gtttgcgtat 3420 
ctcgctgcgc tcggtcgttc ggctgcggcg 3480 
acggttatcc acagaatcag gggataacgc 3540 
aaaggccagg aaccgtaaaa aggccgcgtt 3600 
tgacgagcat cacaaaaatc gacgctcaag 3660 
aagataccag gcgtttcccc ctggaagctc 3720 
gcttaccgga tacctgtccg cctttctccc 3780 
acgctgtagg tatctcagtt cggtgtaggt 3840 
accccccgtt cagcccgacc gctgcgcctt 3900 
ggtaagacac gacttatcgc cactggcagc 3960 
gtatgtaggc ggtgctacag agttcttgaa 4020 
gacagtattt ggtatctgcg ctctgctgaa 4080 
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gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg 4140 
tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag gatctcaaga 4200 
agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact cacgttaagg 4260 
gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa attaaaaatg 4320 
aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt accaatgctt 4380 
aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag ttgcctgact 4440 
ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca gtgctgcaat 4500 
gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc agccagccgg 4560 
aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt ctattaattg 4620 
ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat 4680 
tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca gctccggttc 4740 
ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg ttagctcctt 4800 
cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca tggttatggc 4860 
agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg tgactggtga 4920 
gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct cttgcccggc 4980 
gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca tcattggaaa 5040 
acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca gttcgatgta 5100 
acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg tttctgggtg 5160 
agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac ggaaatgttg 5220 
aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat 5280 
gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc cgcgcacatt 5340 
tccccgaaaa gtgccacctg acgtcgacgg atcgggagat ctcccgatcc cctatggtcg 5400 
actctcagta caatctgctc tgatgccgca tagttaagcc agtatctgct ccctgcttgt 5460 
gtgttggagg tcgctgagta gtgcgcgagc aaaatttaag ctacaacaag gcaaggcttg 5520 
accgacaatt gcatgaagaa tctgcttagg gttaggcgtt ttgcgctgct tcgcgatgta 5580 
cgggccagat atacgcgttg acattgatta ttgactagtt attaatagta atcaattacg 5640 
gggtcattag ttcatagccc atatatggag ttccgcgtta cataacttac ggtaaatggc 5700 
ccgcctggct gaccgcccaa cgacccccgc ccattgacgt caataatgac gtatgttccc 5760 
atagtaacgc caatagggac tttccattga cgtcaatggg tggactattt acggtaaact 5820 
gcccacttgg cagtacatca agtgtatcat atgccaagta cgccccctat tgacgtcaat 5880 
gacggtaaat ggcccgcctg gcattatgcc cagtacatga ccttatggga ctttcctact 5940 
tggcagtaca tctacgtatt agtcatcgct attaccatgg tg 5982 



<210> 13 
<211> 13361 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 13 

tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta 60 
aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct acttggcagt 120 
acatctacgt attagtcatc gctattacca tggtgatgcg gttttggcag tacatcaatg 180 
ggcgtggata gcggtttgac tcacggggat ttccaagtct ccaccccatt gacgtcaatg 240 
ggagtttgtt ttggcaccaa aatcaacggg actttccaaa atgtcgtaac aactccgccc 300 
cattgacgca aatgggcggt aggcgtgtac ggtgggaggt ctatataagc agagctctct 360 
ggctaactag agaacccact gcttaactgg cttatcgaaa ttaatacgac tcactatagg 420 
gagacccaag cttggtaccg agctcggatc cactagtaac ggccgccagt gtgctggaat 480 
tctgcagata tccatcacac tggcggccca taatcaagtc attgtttaaa ggcccagata 540 
aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 600 
acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 660 
gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 720 
gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 780 
gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 840 
gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 900 
gtagcaggac ttttaaatat gacggtgtct actgctgctg cagctgaaaa tatgtattct 960 
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caaatgggat tagacactag gccatctatg 
ccacaggcat atcctattca aacagtaaat 
aaaatggtgt ccatttttat ggaaaaggca 
ctatggttta ctgccttctc tgcaaattta 
gccgcaccag ggtgcgctgc agataaagaa 
gcagaatatg atcgcacaca tccccctgat 
gcagaaatta tgggtatagg attaactcaa 
gctaggatgc agtgtagagc atggtatctc 
gctaagtctc ctcgagctgt gcagttaaga 
atagacagat tgtttgccca aatagatcaa 
ttaaaacagt cattgagcat agctaatgct 
cttaagccag aaagtaccct agaagaaaag 
ggatataaaa tgcaactctt ggcagaagct 
ggatcaggac cagtgtgttt taattgtaaa 
gaagtgaaaa aatgtaataa atgtggaaaa 
ggaaatagaa agaattcggg aaactggaag 
atgcagcaag cagtaatgcc atctgcacct 
attataataa agtaggtact actacaacat 
taaatggata tcctataaaa tttttattag 
ggagagattt tcaagtaaaa aattctatag 
gaggaggaaa gagaggaaca aattatatta 
ataagacaca atgtatattt ggtaatgttt 
cattattagg gagagataat atgattaaat 
ctgataagat tccagtagta aaagtaaaaa 
aacaatggcc attaacaaat gaaaaaattg 
aaagagaagg gaaagtaaaa agagcagatc 
'ctataaaaaa gaaaagtgga aaatggagaa 
taactgagaa aggagcagag gtccagttgg 
aaaaacaagt aacagtatta gatatagggg, 
attatgctcc ttatacagca tttactttac 
gatttgtgtg gtgtagtcta ccacaaggct 
cattagataa tataatacaa ccttttatta 
atatggatga catttatata ggatcaaatt 
aagaattaag aaaattacta ttatggtggg 
aagaaccccc atatacatgg atgggttatg 
agaaacagtt agacattcca gaacagccca 
aaattaattg ggctagccaa gctattccag 
tgagaggaaa tcaaaaccta aattcaacaa 
tacaaaaggc aaaaaaggct atagaagaac 
aggagttata tgctaaatta agtttggtgg 
agaaggatcc agaaaagata ctatggtatg 
aaaatacatg tgatatagcc ttaagagcat 
gaataggaaa agaaccaaga tatgaaatac 
taattaattc accatatctt aaggccccac 
tgaatataaa gagagcgtta agtatgataa 
ggtatataga tggaggtaga aagctaggaa 
caggaaagtg gcaagtgatg gaattagaag 
tattattggc attaaaagca ggatcagagg 
ttataaatat tattcttcaa caaccagata 
aagaattgga gaagaaaaca gcaatattta 
caggaaatga ggaagtagat aagctttgtc 
tattagataa aaggtcagaa gatgcaggat 
tattgccagg agaggtaaaa gtaataccaa 
attggggatt aataatagga aaaagctcga 
gggtaataga cgaaggatat cgaggtgaaa 
aatcaatcac cttaatggaa cgacaaaaga 
atgaagtatt agaacaagga aaagtagtaa 
ggtcaacagg agtattctcc tcttgggttg 
aaaaatttca ctcagatcca cagtacttaa 
cagaagagat aagacgaaaa tgcccagtat 
aattgaaaat agggcctggt atctggcaaa 



aaagaagcag gtggaaaaga ggaaggccct 1020 
ggagtaccac aatatgtagc acttgaccca 1080 
agagaaggac taggaggtga ggaagttcaa 1140 
acacctactg acatggccac attaataatg 1200 
atattggatg aaagcttaaa gcaactgaca 1260 
gctcccagac cattacccta ttttactgca 1320 
gaacaacaag cagaagcaag atttgcacca 1380 
gaggcattag gaaaattggc tgccataaaa 1440 
caaggagcta aggaagatta ttcatccttt 1500 
gaacaaaata cagctgaagt taagttatat 1560 
aatgcagact gtaaaaaggc aatgagccac 1620 
ttgagagctt gtcaagaaat aggctcacca 1680 
cttacaaaag ttcaagtagt gcaatcaaaa 1740 
aaaccaggac atctagcaag acaatgtaga 1800 
cctggtcatg tagctgccaa atgttggcaa 1860 
gcggggcgag ctgcagcccc agtgaatcaa 1920 
ccaatggagg agaaactatt ggatttataa 1980 
tagaaaagag gccagaaata ctcatatttg 2040 
acacaggagc agatataaca attttaaata 2100 
aaaatggaag gcaaaatatg attggagtag 2160 
atgtacattt agagattaga gatgaaaatt 2220 
gtgtcttaga agataactca ttaatacaac 2280 
tcaatattag gttagtaatg gctcaaattt 2340 
tgaaggatcc taataaagga cctcaaataa 2400 
aagccttaac agaaatagta gaaagactag 2460 
caaataatcc atggaataca ccagtatttg 2520 
tgctcataga ttttagagaa ttaaacaaac 2580 
gactacctca tcctgctggt ttacaaataa 2640 
atgcatattt caccattcct cttgatccag 2700 
ctagaaaaaa taatgcggga ccaggaagga 2760 
ggattttaag tccattgata tatcaaagta 2820 
gacaaaatcc tcaattagat atttaccaat 2880 
taagtaaaaa ggagcataaa gaaaaggtag 2940 
gatttgaaac tccagaagat aaattacagg 3000 
aattacatcc attaacatgg acaatacaac 3060 
ctctaaatga gttgcaaaaa ttagcaggaa 3120 
acttgagtat aaaagcatta actaacatga 3180 
gacaatggac taaagaagct cgactggaag 3240 
aagtacaact aggatactat gaccccagta 3300 
gaccacatca aataagttat caagtatatc 3360 
gaaaaatgag tagacaaaag aaaaaggcag 3420 
gctataagat aagagaagag tctattataa 3480 
ctacttctag agaagcctgg gaatcaaatt 3540 
ctcctgaggt agaatatatc catgctgctt 3600 
aagatgctcc aataccagga gcagaaacat 3660 
aagcagcaaa agcagcctat tggacagata 3720 
gcagtaatca gaaggcagaa atacaagcat 3780 
agatgaatat tataacagat tcacaatatg 3840 
tgatggaggg aatctggcaa gaagttttag 3900 
tagattgggt cccaggacat aaaggtattc 3960 
aaacaatgat gataatagaa ggggatggga 4020 
atgatttatt agctgcaaaa gaaatacatt 4080 
caggggtaaa gctaatgttg cctaaaggat 4140 
tagggagtaa aggattggat gtattaggag 4200 
ttggagtaat aatgattaat gtatcaagaa 4260 
tagcacaatt aataatattg ccttgtaaac 4320 
tggattcaga gagaggagac aatggttatg 4380 
acagaattga ggaagcagaa ataaatcatg 4440 
ggactgaatt taatttacct aaaatggtag 4500 
gcagaatcag aggagaacaa gtgggaggac 4560 
tggattgcac acactttgat ggcaaaataa 4620 
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ttcttgtggg tatacatgtg gaatcaggat 
ctgctgactg tacagttaaa gctgtcttac 
tacaaacaga taatggacca aattttaaaa 
tgggtgtgaa acataagttt ggtatcccag 
atgtaaatca tacattaaaa gtttggattc 
ataatgcctt atctctcgct gtacatagtc 
ggatggcccc ttatgaatta ttagcacaac 
ctgcaatacc acaaaaattg caagcacagt 
aatggaaagg accaatgaga gtagaatact 
aagagaaggg atattttctt atacctagga 
ctcttcctga aggggatgag tgaagaagat 
ctccaaggag gagtaaatag cgctatgcta 
gaaaagtata aaaaagactt caagaaaaga 
agactacgga aagctgaagg aataaaatgg 
tatgtcagag aaatggtggc aggatccact 
ataagtaacc cactatggca ttctcagtat 
tggccttttg taaatatgtg gataaaaaca 
aatatttgta taggaggaga agtttcacca 
ataaaagctt ttagttgtgg cgaaagaaag 
ggagaaatag atccaaaaaa atggtgcgga 
tcacctccaa agactttaca aagactcgct 
tggcgaggat gctgtaatca acgctttgtt 
gtcattcaat ccaagcccag ctggaacctg 
ataatagtat tattcaatag ggtcactgag 
tttgtattag cacatcaatt agaaagggac 
tggagatata gatttaagaa accccgagta 
ttctattatt ggcagttgca atctacatta 
taatatttca tttgcaacaa taagaatggc 
aggactagaa gaagctgaag agttattaga 
aggaccacta aatccaggag taaacccatt 
gcaaaactac tgtaacatat tacaacctaa 
ggtaaaactg gaagaaggaa atgcaggtaa 
tgatgaaagt gtattgtccc tggttcatgc 
tcgaaataag ttaggatctt taagacatga 
ttataataat agagagaagg gtacaactga 
aggaacggtg actttgtacc tgattttatt 
caacgctcag gtagtatgga gacttccacc 
aattttttgg gattgttggg caccagaaga 
gatacatcta aaagctaaga caaatataag 
ggctagagaa atatgggcaa cattattcaa 
aatatggaaa agatggaatg agactataac 
ttataatgtt tcagtaatag tacctgatta 
gttacaaggg aaaataaata tatcattatg 
agttacaaaa caattaagct attgtacaga 
atttggacct aatcaaacat gtatgtggaa 
aaggccgcgg ccctggcaac ccatcaagaa 
ataaacaact taagattagt tacattagag 
gaagctatgg aaaaattttt gtatacagct 
aatcaatttt tctgcaaaat ccctcctgag 
caaacaatat ggaatcatgg aaatataact 
ttacaacaaa agttttatga aataataatg 
aaagggatac aacaattaca aaagtgggaa 
caatatttaa agggactatt gggaggtatc 
attttatgtt tacctacatt ggttgattgt 
tacacagtaa ttgcaatgcc tgaagtagaa 
aggagaaatg gtaggcaatg tggcatgtct 
gacttatttt ataagggaga tactgtgctg 
tgaatccatt tcgaatcaaa tcaaactaat 
acaaagaaga agaagaaaga agaaagcctt 
cttcaggaag ctatttggca cgacttctac 
tgaacctcct aaaaaagaaa aaagggtgga 



atatatgggc acaaataatt tctcaagaaa 4680 
aattgttgag tgctcataat gttactgaat 4740 
atcaaaagat ggaaggagta ctcaattaca 4800 
ggaacccaca gtcacaagca ttagttgaaa 4860 
ggaaattttt gcctgaaaca acctccttgg 4920 
tcaattttaa aagaagaggt aggataggag 4980 
aagaatcctt aagaa'tacaa gattattttt 5040 
ggatttatta taaagatcaa aaagataaga 5100 
ggsgacaggg atcagtatta ttaaaggatg 5160 
gacacataag gagagttcca gaaccctgcg 5220 
tggcaggtaa gtagaagact ctttgcagtg 5280 
tacatatcta ggctacctcc ggatgaaaga 5340 
ctttttgaca cagaaacagg atttataaag 5400 
agctttcata ctagagatta ttacatagga 5460 
acatcattaa gtctaaggat gtatatatat 5520 
cgtccaggtt tgaaaaattt caataaggaa 5580 
ggatttatgt gggatgatat tgaaaaacaa 5640 
ggatggggac cagggatggt aggtatagca 5700 
attgaggcta ctcctgtaat gattataaga 5760 
gattgttgga atttaatgtg tcttagaaac 5820 
atgttggcgt gtggcgtgcc ggctaagaag 5880 
tctccttaca gaacgcctgc tgatttagag 5940 
ttatggtcgg gagaattatg aatggaagac 6000 
aaactagaaa aagaattagc tatcagaata 6060 
aaagctatta gattactaca aggattattt 6120 
gattattgtt tatgttggtg gtgttgcaaa 6180 
tcaataacta ctgcttagaa atatttagat 6240 
agaaggattt gcagccaata gacaatggat 6300 
ttttgatata gcaacacaaa tgagtgaaga 6360 
tagggtacct ggaataacag aaaaagaaaa 642 0 
gttacaagat ctaaggaacg aaattcaaga 6480 
gtttagaaga gcaagatttt taaggtattc 6540 
gttcatagga tattgtatat atttaggtaa 6600 
cattgatata gaagcacccc aagaagagtg 6660 
caatataaaa tatggtagac gatgttgcct 6720 
tataggaata ataatatatt cacagacaac 6780 
attagtagtc ccagtagaag aatcagaaat 6840 
acccgcctgt caggactttc ttggggcaat 6900 
tatacgagag ggacctacct tggggaattg 6960 
aaaggctact agacaatgta gaagaggcag 7020 
aggaccatca ggatgtgcta ataacacatg 7080 
tcagtgttat ttagatagag tagatacttg 714 0 
tctaacagga ggaaaaatgt tgtacaataa 7200 
cccattacaa atcccactga tcaattatac 7260 
tacttcacaa attcaggacc ctgaaatacc 7320 
gctgtagaaa aggtgactga agccttaaag 7380 
catcaagtac tagtaatagg attaaaagta 7440 
ttcgctatgc aagaattagg atgtaatcaa 7500 
ttgtggacaa ggtataatat gactataaat 7560 
ttgggggaat ggtataacca aacaaaagat 7620 
gacatagaac aaaataatgt acaagggaag 7680 
gattgggtag gatggatagg aaatattcca 7740 
ttgggaatag gattaggagt gttattattg 7800 
ataagaaatt gtatccacaa gatactagga 7860 
ggagaagaaa tacaaccaca aatggaattg 7920 
gaaaaagagg aggaatgatg aagtatctca 7980 
agttcttccc tttgaggaag gtatgtcata 8040 
aaagtatgta ttgtaaggta aaaggaaaag 8100 
caagaggatg atgacagagt tagaagatcg 8160 
aacgggagac agcacagtag attctgaaga 8220 
ctgggatgag tattggaacc ctgaagaaat 8280 
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agaaagaatg cttatggact agggactgtt 
ctagagggcc ctattctata gtgtcaccta 
gtgccttcta gttgccagcc atctgttgtt 
gaaggtgcca ctcccactgt cctttcctaa 
agtaggtgtc attctattct ggggggtggg 
gaagacaata gcaggcatgc tggggatgcg 
accagctggg gctcgagggg ggatccccac 
ggtgtggtgg ttacgcgcag cgtgaccgct 
ttcgctttct tcccttcctt tctcgccacg 
cggggcatcc ctttagggtt ccgatttagt 
gattagggtg atggttcacg tagtgggcca 
acgttggagt ccacgttctt taatagtgga 
cctatctcgg tctattcttt tgatttataa 
aaaaatgagc tgatttaaca aaaatttaac 
atttaaatat ttgcttatac aatcttcctg 
gtgggtaccg agctcgaatt ctgtggaatg 
gctccccagg caggcagaag tatgcaaagc 
ggaaagtccc caggctcccc agcaggcaga 
gcaaccatag tcccgcccct aactccgccc 
cattctccgc cccatggctg actaattttt 
gcctctgagc tattccagaa gtagtgagga 
aagctcbcgg gagcttggat atccattttc 
tttcgcatga ttgaacaaga tggattgcac 
ctattcggct atgactgggc acaacagaca 
ctgtcagcgc aggggcgccc ggttcttttt 
gaactgcagg acgaggcagc gcggctatcg 
gctgtgctcg. acgttgtcac tgaagcggga 
gggcaggatc tcctgtcatc tcaccttgct 
gcaatgcggc ggctgcatac gcttgatccg 
catcgcatcg agcgagcacg tactcggatg 
gacgaagagc atcaggggct cgcgccagcc 
jcccgacggcg aggatctcgt cgtgacccat 
gaaaatggcc gcttttctgg attcatcgac 
caggacatag cgttggctac ccgtgatatt 
cgcttcctcg tgctttacgg tatcgccgct 
cttcttgacg agttcttctg agcgggactc 
ccaacctgcc atcacgagat ttcgattcca 
gaatcgtttt ccgggacgcc ggctggatga 
tcttcgccca ccccaacttg tttattgcag 
tcacaaattt cacaaataaa gcattttttt 
tcatcaatgt atcttatcat gtctggatcc 
gtcatagctg tttcctgtgt gaaattgtta 
cggaagcata aagtgtaaag cctggggtgc 
gttgcgctca ctgcccgctt tccagtcggg 
cggccaacgc gcggggagag gcggtttgcg 
tgactcgctg cgctcggtcg ttcggctgcg 
aatacggtta tccacagaat caggggataa 
gcaaaaggcc aggaaccgta aaaaggccgc 
ccctgacgag catcacaaaa atcgacgctc 
ataaagatac caggcgtttc cccctggaag 
gccgcttacc ggatacctgt ccgcctttct 
ctcacgctgt aggtatctca gttcggtgta 
cgaacccccc gttcagcccg accgctgcgc 
cccggtaaga cacgacttat cgccactggc 
gaggtatgta ggcggtgcta cagagttctt 
aaggacagta tttggtatct gcgctctgct 
tagctcttga tccggcaaac aaaccaccgc 
gcagattacg cgcagaaaaa aaggatctca 
tgacgctcag tggaacgaaa actcacgtta 
gatcttcacc tagatccttt taaattaaaa 
tgagtaaact tggtctgaca gttaccaatg 



tacgaacaaa tgataaaagg aaatagctga 8340 
aatgctagag ctcgctgatc agcctcgact 8400 
tgcccctccc ccgtgccttc cttgaccctg 8460 
taaaatgagg aaattgcatc gcattgtctg 8520 
Qtggggcagg acagcaaggg ggaggattgg 8580 
gtgggctcta tggcttctga ggcggaaaga 8640 
gcgccctgta gcggcgcatt aagcgcggcg 8700 
acacttgcca gcgccctagc gcccgctcct 8760 
ttcgccggct ttccccgtca agctctaaat 8820 
gctttacggc acctcgaccc caaaaaactt 8880 
tcgccctgat agacggtttt tcgccctttg 8940 
ctcttgttcc aaactggaac aacactcaac 9000 
gggattttgg ggatttcggc ctattggtta 9060 
gcgaatttta acaaaatatt aacgtttaca 9120 
tttttggggc ttttctgatt atcaaccggg 9180 
tgtgtcagtt agggtgtgga aagtccccag 9240 
atgcatctca attagtcagc aaccaggtgt 9300 
agtatgcaaa gcatgcatct caafctagtca 9360 
atcccgcccc taactccgcc cagttccgcc 9420 
tttatttatg cagaggccga ggccgcctcg 9480 
ggcttttttg gaggcctagg cttttgcaaa 9540 
ggatctgatc aagagacagg atgaggatcg 9600 
gcaggttctc cggccgcttg ggtggagagg 9660 
atcggctgct ctgatgccgc cgtgttccgg 9720 
gtcaagaccg acctgtccgg tgccctgaat 9780 
tggctggcca cgacgggcgt tccttgcgca 9840 
agggactggc tgctattggg cgaagtgccg 9900 
cctgccgaga aagtatccat catggctgat 9960 
gctacctgcc cattcgacca ccaagcgaaa 10020 
gaagccggtc ttgtcgatca ggatgatctg 10080 
gaactgttcg ccaggctcaa ggcgcgcatg 10140 
ggcgatgcct gcttgccgaa tatcatggtg 10200 
tgtggccggc tgggtgtggc ggaccgctat 10260 
gctgaagagc ttggcggcga atgggctgac 10320 
cccgattcgc agcgcatcgc cttctatcgc 10380 
tggggttcga aatgaccgac caagcgacgc 1044 0 
ccgccgcctt ctatgaaagg ttgggcttcg 10500 
tcctccagcg cggggatctc atgctggagt 10560 
cttataatgg ttacaaataa agcaatagca 10620 
cactgcattc tagttgtggt ttgtccaaac 10680 
cgtcgacctc gagagcttgg cgtaatcatg 10740 
tccgctcaca attccacaca acatacgagc 10800 
ctaatgagtg agctaactca cattaattgc 10860 
aaacctgtcg tgccagctgc attaatgaat 10920 
tattgggcgc tcttccgctt cctcgctcac 10980 
gcgagcggta tcagctcact caaaggcggt 11040 
cgcaggaaag aacatgtgag caaaaggcca 11100 
gttgctggcg tttttccata ggctccgccc 11160 
aagtcagagg tggcgaaacc cgacaggact 11220 
ctccctcgtg cgctctcctg ttccgaccct 11280 
cccttcggga agcgtggcgc tttctcaatg 11340 
ggtcgttcgc tccaagctgg gctgtgtgca 11400 
cttatccggt aactatcgtc ttgagtccaa 11460 
agcagccact ggtaacagga ttagcagagc 11520 
gaagtggtgg cctaactacg gctacactag 11580 
gaagccagtt accttcggaa aaagagttgg 11640 
tggtagcggt ggtttttttg tttgcaagca 11700 
agaagatcct ttgatctttt ctacggggtc 11760 
agggattttg gtcatgagat tatcaaaaag 11820 
atgaagtttt aaatcaatct aaagtatata 11880 
cttaatcagt gaggcaccta tctcagcgat 11940 
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ctgtctattt cgttcatcca tagttgcctg actccccgtc gtgtagataa ctacgatacg 12000 
ggagggctta ccatctggcc ccagtgctgc aatgataccg cgagacccac gctcaccggc 12060 
tccagattta tcagcaataa accagccagc cggaagggcc gagcgcagaa gtggtcctgc 12120 
aactttatcc gcctccatcc agtctattaa ttgttgccgg gaagctagag taagtagttc 12180 
gccagttaat agtttgcgca acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc 12240 
gtcgtttggt atggcttcat tcagctccgg ttcccaacga tcaaggcgag ttacatgatc 12300 
ccccatgttg tgcaaaaaag cggttagctc cttcggtcct ccgatcgttg tcagaagtaa 12360 
gttggccgca gtgttatcac tcatggttat ggcagcactg cataattctc ttactgtcat 12420 
gccatccgta agatgctttt ctgtgactgg tgagtactca accaagtcat tctgagaata 12480 
gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata cgggataata ccgcgccaca 12540 
tagcagaact ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa aactctcaag 12600 
gatcttaccg ctgttgagat ccagttcgat gtaacccact cgtgcaccca actgatcttc 12660 
agcatctttt actttcacca gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc 12720 
aaaaaaggga ataagggcga cacggaaatg ttgaatactc atactcttcc tttttcaata 12780 
ttattgaagc atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta 12840 
gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac c'tgacgtcga 12900 
cggatcggga gatctcccga tcccctatgg tcgactctca gtacaatctg ctctgatgcc 12960 
gcatagttaa gccagtatct gctccctgct tgtgtgttgg aggtcgctga gtagtgcgcg 13020 
agcaaaattt aagctacaac aaggcaaggc ttgaccgaca attgcatgaa gaatctgctt 13080 
agggttaggc gttttgcgct gcttcgcgat gtacgggcca gatatacgcg ttgacattga 13140 
ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag cccatatatg 13200 
gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc caaqgacccc 13260 
cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat 13320 
tgacgtcaat gggtggacta tttacggtaa actgcccact c 13361 



<210> 14 
<211> 9569 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 14 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 120 • 
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 180 
gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 
agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 
tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 360 
cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 420 
tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg 480 
agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata 540 
aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 600 
acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 660 
gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 720 
gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 780 
gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 840 
gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 900 
gtagcaggac ttttaaatat gacgggtgtc tactgctgct gcagctgaaa atatgtattc 960 
tcaaatggga ttagacacta ggccatctat gaaagaagca ggtggaaaag aggaaggccc 1020 
tccacaggca tatcctattc aaacagtaaa tggagtacca caatatgtag cacttgaccc 1080 
aaaaatggtg tccattttta tggaaaaggc aagagaagga ctaggaggtg aggaagttca 1140 
actatggttt actgccttct ctgcaaattt aacacctact gacatggcca cattaataat 1200 
ggccgcacca gggtgcgctg cagataaaga aatattggat gaaagcttaa agcaactgac 1260 
agcagaatat gatcgcacac atccccctga tgctcccaga ccattaccct attttactgc 1320 
agcagaaatt atgggtatag gattaactca agaacaacaa gcagaagcaa gatttgcacc 1380 
agctaggatg cagtgtagag catggtatct cgaggcatta ggaaaattgg ctgccataaa 1440 
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agctaagtct cctcgagctg tgcagttaag 
tatagacaga ttgtttgccc aaatagatca 
tttaaaacag tcattgagca tagctaatgc 
ccttaagcca gaaagtaccc tagaagaaaa 
aggatataaa atgcaactct tggcagaagc 
aggatcagga ccagtgtgtt ttaattgtaa 
agaagtgaaa aaatgtaata aatgtggaaa 
aggaaataga aagaattgta caagggaaga 
attgggtagg atggatagga aatattccac 
tgggaatagg attaggagtg ttattattga 
taagaaattg tatccacaag atactaggat 
gagaagaaat acaaccacaa atggaattga 
aaaaagagga ggaatgatga agtatctcag 
gttcttccct ttgaggaagg tatgtcatat 
gcctgctatt gtcttcccaa tcctccccct 
tagaatgaca cctactcaga caatgcgatg 
tgggagtggc accttccagg gtcaaggaag 
ggcaactaga aggcacagtc gaggctgatc 
cttccagatt ttctagctga tgattagttc 
ctctagatgt tctgacgaat tcgagtggag 
atacagaact ctctgaagca tcgtttgaag 
ttgaggcagc tgtttgtgta aectagagca 
gtagtttctg ggattgtaac caaggcttta 
cagacgatga acacagccac caccaccagc 
tccttttctt tggagccaga gagcatgcgg 
tagcacacgg taatgatgag cactggcata 
agcaggtttt cccagtacca ggttggatga 
tgcctgtatt ttgttgtagc catgaacatt 
cagttgcaga cattgataat tttggcattt 
gggtggcaga ctgcaatgta tcgatcaaca 
aacatgttat agtaatctat ggagatcact 
gttcccatta ggtaattcac actctggaag 
agagcaaggt tgaaaatgta gatgttggtg 
acatacatga ccaggaagtt tccgaagagc 
atgatcgtga tggccgtgat catggaggga 
ccgcccaggt cggtgcggtt cggaccgcat 
aagttgaccc aggaaccggg gctgggtgct 
gtgcaattgc tggcgttcgt gggggcagcg 
gcaccgctgt gcgaggtagc caggagcacc 
gcttcctttc gccgctgcca cagctcctgc 
acagagagga gcgtcaggcg gaggggaccg 
tagccccctc ccaccttagt agttcacaga 
cttggccgcg ggtacaattc cgcagctttt 
agaagtaaag gcaacatcca ctgaggagca 
ggacctgaaa taaaagacaa aaagactaaa 
cctcgaggac tgcagactca gccggtggcg 
gagcaggagt ggcggcggct gcggctccca 
cctatagagc cgagcgagcg catgtgaccc 
ggagacgcac gtacgagcgc gggtggtggt 
ggcgatccag gaccaagctg gcaaccccat 
ccaactcgag ggaagcaggc gtccccttat 
agatggagaa gcccaccaca ggggttcacg 
tatttgaaag ctgaactgat ccatcacagc 
ccctactagc atcttagtcg cctcggatct 
ggcccacaga acaagccaca cagagatgac 
ggatttccag tctgggacaa aaaggaacat 
cggtggtccc cactggggta ttatgagttc 
aagagaagac aagtgaactg tgtgccctac 
ctctctgcct caatctgccc tcctttcgcc 
tcagattttt agtgcaggat cttttccctg 
cacctccttc ccttcatgcg ctcagggttc 



acaaggagct aaggaagatt attcatcctt 1500 
agaacaaaat acagctgaag ttaagttata 1560 
taatgcagac tgtaaaaagg caatgagcca 1620 
gttgagagct tgtcaagaaa taggctcacc 1680 
tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aaaaccagga catctagcaa gacaatgtag 1800 
acctggtcat gtagctgcca aatgttggca 1860 
aagggataca acaattacaa aagtgggaag 1920 
aatatttaaa gggactattg ggaggtatct 1980 
ttttatgttt acctacattg gttgattgta 2040 
acacagtaat tgcaatgcct gaagtagaag 2100 
ggagaaatgg taggcaatgt ggcatgtctg 2160 
acttatttta taagggagat actgtgctga 2220 
gaatccattt cgaatcaaat tccccagcat 2280 
tgctgtcctg ccccacccca ccccccagaa 234 0 
caatttcctc attttattag gaaaggacag 2400 
gcacggggga ggggcaaaca acagatggct 2460 
agcgagctct gggcaacgga gcagtttctg 252 0 
tatccactgt attggccgtg gaggggtggt 2580 
ttttgttgct caatgttgga agaggttggg 2640 
ttttcatcca gaaatgeata aaggactggg 2700 
atgcagaagt gccaagaaac agtctggaac 2760 
atgatgacgt aaatgtgaat gggagtccag 2820 
accatcctgg tgatccttcg aagattcctg 2880 
acactcttga ggcgcaagat catcagtcca 2940 
atgaaggcga agatgaaaac acagatcttc 3000 
gagaatgtta gtgtacaatc tatggaacct 3060 
acaggaagac caatggctga agagaggatc 3120 
cggggagtac ggaaatctaa ggccttgaca 3180 
ctcatggtgc agagggtgaa tatgctggtg 3240 
atcttgcaaa ggatggttcc aaatggccat 3300 
ggcagggtac tggtggctaa ggcatctgcc 3360 
gcagtcttca tcttggtgta tctgacaatc 3420 
cccaccacgc acacgatgga gtagagggcc 3480- 
ctgccggtcg gagggcacag gctgtctctc 3540 
gggtcggaca ggttgccatc taagtgggac 3600. 
ggggagcaac ttgagtacgc caaggcatca 3660* 
ctgctgtcca tggtactgac ggccgggcgg 3720 
gagacttttc gggttccaag cgcctcagcc 3780 
tgtttcttac agaaaccagt cctggctgag 3840 
agctgagcat ctgacattct cctctgcgta 3900 
ggctcatccg gaattcgata tcaagcttag 3960 
agagcagaag taacacttcc gtacaggcct 4020 
gttctttgat ttgcaccacc accggatccg 4080 
cttaccagtt aactttctgg tttttcagtt 4140 
gtggcggtgg tgacggcggc ggcgcagaga 4200 
gcgcgggcgg ggggtgggca gggggcggcg 4260 
gggggccccg atgagtcaag agctgcaggc 4320 
ggggaaccga acgggtgggg gaagggcccg 4380 
aagtacacgg atgctcataa ccttgtccag 4440 
tccagtatag tggcaggtac taacccagac 4500 
aactcgacac agaatataaa aacgcctttt 4560 
ccacctcctc ggctcttctc tccagacaac 4620 
gcttgagagc tgtgcttagg actttaggta 4680 
caccacagaa cctgagcctc gggatgtaat 4740 
ctcccacttc ttgtaacagg gctatgaatg 4800 
ttgaggtcag catctytggc ctcctggaaa 4860 
cggaatgccc acatccccag gcctctcagt 4920 
tctccacccc gtgtcctgtt atttttagct 4980 
gcccagctac atcctccata taccaccagc 5040 
atcccttcct cccagtctct accagctatg 5100 
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tctgtagaga cagcggtcct ctgcctccct 
gtgtctfcttc ttacctcttc ctgagcctct 
cagcctctga ctctggcctc tcctcccgcg 
tcctgccatc acctttactg tccctgcagg 
cggttccatg tcttggtccg ggtcgagttc 
gatctcctca gcttctgccc ggtcctcgtc 
gtgtgccatc tgcaccaggg cctcctggcg 
cagcagcacc aaggccctca gcttgggcag 
actgatcatg ttactcctgt atatgaggtc 
gagtgttagc tacagatcgg gcttcctttg 
cctggaagca gtgatggatg aggaccagcc 
ccacccttta gagtccctaa atgccgtgca 
aagctaggcc tcatcctgt.c. ttatatttgg 
agacattttc tgtgtcagcc ataccaccat 
aggaaatgga attgaggaga aatggtaggc 
gatgaagtat ctcagactta ttttataagg 
gaaggtatgt catatgaatc catttcgaat 
tgggatgagt attggaaccc tgaagaaata 
acgaacaaat gataaaagga aatagctgag 
cctaaccgca aaaccacatc ctatggaaag 
taagagtata taaccagtgc tttgtgaaac 
agttctccct tgaggctccc acagatacaa 
tctgtgtaat cttttttacc tgtgaggtct 
cgaccaagcg acgcccaacc tgccatcacg 
aaggttgggc ttcggaatcg ttttccggga 
tctcatgctg gagttcttcg cccaccccaa 
ataaagcaat agcatcacaa atttcacaaa 
tggtttgtcc aaactcatca atgtatctta 
ttggcgtaat catggtcata gctgtttcct 
cacaacatac gagccggaag cataaagtgt 
ctcacattaa ttgcgttgcg ctcactgccc 
ctgcattaat gaatcggcca acgcgcgggg 
gcttcctcgc tcactgactc gctgcgctcg 
cactcaaagg cggtaatacg gttatccaca 
tgagcaaaag gccagcaaaa ggccaggaac 
cataggctcc gcccccctga cgagcatcac 
aacccgacag gactataaag ataccaggcg 
cctgttccga ccctgccgct taccggatac 
gcgctttctc aatgctcacg ctgtaggtat 
ctgggctgtg tgcacgaacc ccccgttcag 
cgtcttgagt ccaacccggt aagacacgac 
aggattagca gagcgaggta tgtaggcggt 
tacggctaca ctagaaggac agtatttggt 
ggaaaaagag ttggtagptc ttgatccggc 
tttgtttgca agcagcagat tacgcgcaga 
ttttctacgg ggtctgacgc tcagtggaac 
agattatcaa aaaggatctt cacctagatc 
atctaaagta tatatgagta aacttggtct 
cctatctcag cgatctgtct atttcgttca 
ataactacga tacgggaggg cttaccatct 
ccacgctcac cggctccaga tttatcagca 
agaagtggtc ctgcaacttt atccgcctcc 
agagtaagta gttcgccagt taatagtttg 
gtggtgtcac gctcgtcgtt tggtatggct 
cgagttacat gatcccccat gttgtgcaaa 
gttgtcagaa gtaagttggc cgcagtgtta 
tctcttactg tcatgccatc cgtaagatgc 
tcattctgag aatagtgtat gcggcgaccg 
aataccgcgc cacatagcag aactttaaaa 
cgaaaactct caaggatctt accgctgttg 
cccaactgat cttcagcatc ttttactttc 



ctctaattct tctcccttct ctggtctctt 5160 
ctcatctttc cttcatatct gagatacccg 5220 
tgcccagcct ttgacctccg ggggctgcct 5280 
ctagaggagc cactaagttg gcgggaggta 5340 
ttgctcctgt tcctccttca gcctctgtcg 5400 
ctcatagtac tctttgtcta ggcgctctag 5460 
gtagtcagtc tcatcggcac aggggttgtc 5520 
gtcccgcagt ttggccagct cccccaggtc 5580 
gggaggggtt aacgcaggaa aatgggctct 5640 
taagaatacc tcctggaagc cactctgggc 5700 
fctggcatgat gcaattccct cattcctcct 5760 
atgtgttctt ttaaaacttt ggagcagcaa 5820 
agaagagggg caaatgtccc atacaggagc 5880 
agggggttgg aaggattggg cgagcagagg 5940 
aatgtggcat gtctgaaaaa gaggaggaat 6000 
gagatactgt gctgagttct tccctttgag 6060 
caaattccta aaaaagaaaa aagggtggac 6120 
gaaagaatgc ttatggacta gggactgttt 6180 
catgactcat agttaaagcg ctagcagctg 6240 
cttgctaatg acgtataagt tgttccattg 6300 
ttcgaggagt ctctttgttg aggacttttg 6360 
taaatatttg agattgaacc ctgtcgagta 6420 
cggaatccgg gccgagaact tcgcagtgac 6480 
agatttcgat tccaccgccg ccttctatga 6540 
cgccggctgg atgatcctcc agcgcgggga 6600 
cttgtttatt gcagcttata atggttacaa 6660 
taaagcatt.t ttttcactgc attctagttg 6720 
tcatgtctgg atcccgtcga cctcgagagc 6780 
gtgtgaaatt gttatccgct cacaattcca 6840 
aaagcctggg gtgcctaatg agtgagctaa 6900 
gctttccagt cgggaaacct gtcgtgccag 6960 
agaggcggtt tgcgtattgg gcgctcttcc 7020 
gtcgttcggc tgcggcgagc ggtatcagct 7080 
gaatcagggg ataacgcagg aaagaacatg 7140 
cgtaaaaagg ccgcgttgct ggcgtttttc 7200 
aaaaatcgac gctcaagtca gaggtggcga 7260 
tttccccctg gaagctccct cgtgcgctct 7320 
ctgtccgcct ttctcccttc gggaagcgtg 7380 
ctcagttcgg tgtaggtcgt tcgctccaag 7440 
cccgaccgct gcgccttatc cggtaactat 7500 
ttatcgccac tggcagcagc cactggtaac 7560 
gctacagagt tcttgaagtg gtggcctaac 7620 
atctgcgctc tgctgaagcc agttaccttc 7680 
aaacaaacca ccgctggtag cggtggtttt 7740 
aaaaaaggat ctcaagaaga tcctttgatc 7800 
gaaaactcac gttaagggat tttggtcatg 7860 
cttttaaatt aaaaatgaag ttttaaatca 7920 
gacagttacc aatgcttaat cagtgaggca 7980 
tccatagttg cctgactccc cgtcgtgtag 8040 
ggccccagtg ctgcaatgat accgcgagac 8100 
ataaaccagc cagccggaag ggccgagcgc 8160 
atccagtcta ttaattgttg ccgggaagct 8220 
cgcaacgttg ttgccattgc tacaggcatc 8280 
tcattcagct ccggttccca acgatcaagg 8340 
aaagcggtta gctccttcgg tcctccgatc 8400 
tcactcatgg ttatggcagc actgcataat 8460 
ttttctgtga ctggtgagta ctcaaccaag 8520 
agttgctctt gcccggcgtc aatacgggat 8580 
gtgctcatca ttggaaaacg ttcttcgggg 8640 
agatccagtt cgatgtaacc cactcgtgca 8700 
accagcgttt ctgggtgagc aaaaacagga 8760 
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aggcaaaatg ccgcaaaaaa gggaataagg 
ttcctttttc aatattattg aagcatttat 
tttgaatgta tttagaaaaa taaacaaata 
ccacctgacg tcgacggatc gggagatctc 
tctgctctga tgccgcatag ttaagccagt 
ctgagtagtg cgcgagcaaa atttaagcta 
tgaagaatct gcttagggtt aggcgttttg 
cgcgttgaca ttgattattg actagttatt 
atagcccata tatggagttc cgcgttacat 
cgcccaacga cccccgccca ttgacgtcaa 
tagggacttt ccattgacgt caatgggtgg 
tacatcaagt gtatcatatg ccaagtacgc 
ccgcctggca ttatgcccag tacatgacc.t 
acgtattagt catcgctatt accatggtg 



gcgacacgga aatgttgaat actcatactc 8820 
cagggttatt gtctcatgag cggatacata 8880 
ggggttccgc gcacatttcc ccgaaaagtg 8940 
ccgatcccct atggtcgact ctcagtacaa 9000 
atctgctccc tgcttgtgtg ttggaggtcg 9060 
caacaaggca aggcttgacc gacaattgca 9120 
cgctgcttcg cgatgtacgg gccagatata 9180 
aatagtaatc aattacgggg tcattagttc 9240 
aacttacggt aaatggcccg cctggctgac 9300 
taatgacgta tgttcccata gtaacgccaa 9360 
actatttacg gtaaactgcc cacttggcag 942 0 
cccctattga cgtcaatgac ggtaaatggc 9480 
tatgggactt tcctacttgg cagtacatct 9540 

9569 



<210> 15 
<211> 401 
<212> PRT 

<213> Artificial- Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 15 



Met 


Asp 


. Ser 


Gly Ala 


Val 


Pro 


Thr. Asn 


.Ala 


Ser 


Asn Cys 


Thr Asp 


Pro 


1 








5 








10 








15 




Phe 


Thr 


His 


Pro' Ser 


Ser 


Cys 


Ser Pro 


Ala 


Pro 


Ser Pro 


Ser 


Ser 


Trp 








20 








25 








30 






Val 


Asn 


Phe 


Ser 


His 


Leu 


Glu 


Gly Asn 


Leu 


Ser 


Asp Pro 


Cys 


Gly Pro 






35 










40 






45 








Asn 


Arg 
50 


Thr 


Glu 


Leu 


Gly 


Gly 
55 


Ser Asp 


Arg 


Leu 


Cys Pro 
60 


Ser 


Ala 


Gly 


Ser 


Pro 


Ser 


Met 


He 


Thr 


Ala 


He He 


He 


Met 


Ala Leu 


Tyr 


Ser 


He 


65 










70 








75 








80 


Val 


Cys 


Val 


Val 


Gly 


Leu 


Phe 


Gly Asn 


Phe 


Leu 


Val Met 


Tyr Val 


He 










85 








90 








95 




Val 


Arg 


Tyr 


Thr 
100 


Lys 


Met 


Lys 


Thr Ala 
105 


Thr 


Asn 


He Tyr 


He 
110 


Phe 


Asn 


Leu 


Ala 


Leu 
115 


Ala 


Asp 


Ala 


Leu 


Ala Thr 
120 


Ser 


Thr 


Leu Pro 
125 


Phe 


Gin 


Ser 


Val 


Asn 


Tyr 


Leu 


Met 


Gly 


Thr 


Trp Pro 


Phe 


Gly 


Thr He 


Leu Cys Lys 




130 










135 








140 








He 


Val 


He 


Ser 


He 


Asp 


Tyr 


Tyr Asn 


Met 


Phe 


Thr Ser 


He 


Phe 


Thr 


145 










150 








155 








160 


Leu 


Cys 


Thr 


Met 


Ser 
165 


Val 


Asp 


Arg Tyr 


He 
170 


Ala 


Val Cys 


His 


Pro 
175 


Val 


Lys 


Ala 


Leu 


Asp 
180 


Leu 


Arg 


Thr 


Pro Arg 
185 


Asn 


Ala 


Lys He 


He 
190 


Asn 


He 


Cys 


Asn 


Trp 
195 


He 


Leu 


Ser 


Ser 


Ala He 
200 


Gly 


Leu 


Pro Val 
205 


Met 


Phe 


Met 


Ala 


Thr 
210 


Thr 


Lys 


Tyr 


Arg 


Gin 
215 


Gly Ser 


He 


Asp 


Cys Thr 
220 


Leu 


Thr 


Phe 


Ser 


His 


Pro 


Thr 


Trp 


Tyr 


Trp 


Glu Asn 


Leu 


Leu 


Lys He 


Cys 


Val 


Phe 


225 










230 








235 








240 


He 


Phe 


Ala 


Phe 


He 


Met 


Pro 


He Leu 


lie 


He 


Thr Val 


Cys 


Tyr Gly 










245 








250 








255 
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Leu 


Met 


He Leu Arg 


Leu 






260 




Glu 


Lys 


Asp Arg Asn 


Leu 






275 




Val 


Ala 


Val Phe He 


Val 




290 






lie 


Lys 


Ala Leu lie 


Thr 


305 






310 


Trp 


His 


Pne Cys He 


TV "1 _ 

Ala 






325 




Val 


Leu 


Tyr Ala Phe 


Leu 






340 




Phe 


Cys 


He Pro Thr 


Ser 






355 




lie 


Arg 


Gin Asn Thr 


Arg 




370 






Arg 


Thr 


Asn His Gin 


Leu 


385 






390 


Pro 









Lys Ser Val 


Arg Met 


Leu 


265 






Arg Arg He 


inr Arg 


nct. 


280 






cys irp mr 


Pro He 


n± o 


295 




iUU 


lie pro Glu 


inr inr 


xrlie 




Jib 




Leu Gly Tyr 


Thr Asn 


Ser 




330 




Asp Glu Asn 


Phe Lys 


Arg 


345 






Ser Thr He 


Glu Gin 


Gin 


360 






Asp His Pro 


Ser Thr 


Ala 


375 




380 


Glu Asn Leu 


Glu Ala 


Glu 




395 
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Ser 


urxy 


OCI 


Lys 




4* t \J 






v ax 


lieu 


val 


Val 

V CtX 


*y ft r 








He 




Val 


He 




Thr 


Val 


Ser 










Cys 


Leu 


Asn 


Pro 






335 




cys 


pne 


Arg 


V?1U 




350 






Asn 


Ser 


Thr 


Arg 


365 








Asn 


Thr 


Val 


Asp 


Thr 


Thr 


Pro 


Leu 



400 



<210> 16 
<211> 1415 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 16 

gcctgacgct cctctctggc tccgccgggg ttggtcgctg taagaaataa caggagctgt 60 
ggcagcggcg aagacgaagc ggctcgcgcg tggaacccga aaagtcaggg tgctcgcggt 120 
tactcccaac gtggtcccag ccggcggtca gcaccatgga cagcggcgcc gtccccacga 180 
acgccagcaa ctgcactgat cccttcacac acccttcaag ttgctcccca gcacctagtc 240 
ccagctcctg ggtcaacttc tcccacttag aaggcaacct gtccgaccca tgcggtccga 300 
accgcaccga gctgggaggg agcgacagac tgtgcccttc ggccggcagc ccttccatga 360 
tcacggccat catcatcatg gccctctact ccatcgtgtg cgtggtgggg ctcttcggaa 420 
acttcctggt catgtatgtg attgtcaggt acaccaaaat gaagactgcc accaacatct 480 
atattttcaa cctcgccctg gcagatgccc tggcaaccag taccctgcct ttccagagtg 540 
tcaattacct gatgggaaca tggccgtttg gaaccatcct gtgcaagatt gtgatctcca 600 
tagattacta caatatgttc accagcatat tcaccctctg caccatgagt gtggatcgct 660 
acattgcagt ctgccatcct gtcaaggccc tggatttacg cactccccgt aatgccaaga 720 
tcatcaacat ctgcaactgg atcctctctt cagccattgg tctgcctgtg atgttcatgg 780 
caacgacaaa gtaccggcaa ggttccatag attgtacact aacattctct cacccaacgt 840 
ggtactggga aaacctgctg aaaatctgtg ttttcatctt tgccttcatc atgcctatcc 900 
tcatcattac agtgtgttat gggctgatga tcttacgcct caagagtgtc cgcatgctct 960 
ctggctccaa agaaaaggac aggaacctgc gaagaatcac caggatggtg ctggtggttg 1020 
tggctgtgtt cattgtctgc tggacgccca ttcacatcta cgtcatcatt aaagccttga 1080 
tcacaatccc ggaaactact ttccagaccg tttcctggca cttctgcatt gctctaggtt 1140 
ataccaacag ttgcctcaac cccgtccttt atgcatttct ggatgaaaac ttcaaacgat 1200 
gcttcagaga gttctgtatc ccaacttcct ccaccattga gcagcaaaac tccactcgaa 1260 
ttcgtcagaa caccagagac cacccctcca cagccaatac ggtggatagg actaaccatc 1320 
agctagaaaa tctggaagca gaaaccactc cgttacccta actgggtctc ataccattca 1380 
gaccctcact gagcttagac gccacatcta tatga 1415 

<210> 17 
<211> 398 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial 
synthetic construct 



<400> 17 








Met Asp Ser 


Ser Ala Gly 


Pro 


Gly 


1 


5 






Leu Ala Pro 


Ala Ser Cys 


Ser 


Pro 




20 






Ser His Val 


Asp Gly Asn 


Gin 


Ser 


35 






40 


Gly Leu Gly 


Gly Ser His 


Ser 


Leu 


50 




55 




Met Val Thr 


Ala He Thr 


He 


Met 


65 


70 






Val Gly Leu 


Phe Gly Asn 


Phe 


Leu 




85 






Thr Lys Met 


Lys Thr Ala 


Thr 


Asn 




100 






Ala Asp Ala 


Leu Ala Thr 


Ser 


Thr 


115 






120 


Leu Met Gly 


Thr Trp Pro 


Phe 


Gly 


130 




135 




Ser He Asp 


Tyr Tyr Asn 


Met 


Phe 


145 


150 






Met Ser Val 


Asp Arg Tyr 


He 


Ala 




165 






Asp Phe Arg 


Thr Pro Arg 


Asn 


Ala 


■ 


180 






He Leu Ser 


Ser Ala He 


Gly 


Leu 


195 






200 


Lys Tyr Arg 


Gin Gly Ser 


He 


Asp 


210 




215 




Thr Trp Tyr 


Trp Glu Asn 


Leu 


Leu 


225 


230 






Phe He Met 


Pro Val Leu 


He 


He 




245 






Leu Arg Leu 


Lys Ser Val 


Arg 


Met 




260 






Arg Asn Leu 


Arg Arg He 


Thr 


Arg 


275 






280 


Phe He Val 


Cys Trp Thr 


Pro 


He 


290 




295 




Leu He Thr 


He Pro Glu 


Thr 


Thr 


305 


310 






Cys He Ala 


Leu Gly Tyr 


Thr 


Asn 




325 






Ala Phe Leu 


Asp Glu Asn 


Phe 


Lys 




340 






Pro Thr Ser 


Ser Thr He 


Glu 


Gin 


355 






360 


Asn Thr Arg 


Glu His Pro 


Ser 


Thr 


370 




375 




His Gin Leu 


Glu Asn Leu 


Glu 


Ala 


385 


390 







Sequence : /note = 



Asn 


He 


Ser 


Asp Cys 


Ser Asp 


Pro 




10 






15 




Ala 


Pro 


Gly 


Ser Trp 


Leu Asn 


Leu 


25 








30 




Asp 


Pro 


Cys 


Gly Pro Asn Arg 


Thr 








45 






Cys 


Pro 


Gin 


Thr Gly Ser Pro 


Ser 








60 






Ala 


Leu 


Tyr 


Ser He 


Val Cys 


Val 






75 






80 


Val 


Met 


Tyr 


Val He 


Val Arg 


Tyr 




90 






95 




He 


Tyr 


He 


Phe Asn 


Leu Ala 


Leu 


105 








110 




Leu 


Pro 


Phe 


Gin Ser 


Val Asn 


Tyr 








125 






Asn 


He 


Leu 


Cys Lys 


He Val 


He 








140 






Thr 


Ser 


He 


Phe Thr Leu Cys 


Thr 






155 






160 


Val 


Cys 


His 


Pro Val 


Lys Ala 


Leu 




170 






175 




Lys 


He 


Val 


Asn Val 


Cys Asn 


Trp 


185 








190 




Pro 


Val 


Met 


Phe Met 


Ala Thr 


Thr 








205 






Cys 


Thr 


Leu 


Thr Phe 


Ser His 


Pro 








220 






Lys 


He 


Cys 


Val Phe 


He Phe 


Ala 






235 






240 


Thr 


Val 


Cys 


Tyr Gly 


Leu Met 


He 




250 






255 




Leu 


Ser 


Gly 


Ser Lys 


Glu Lys 


Asp 


265 








270 




Met 


Val 


Leu 


Val Val 


Val Ala 


Val 








285 






His 


He 


Tyr 


Val He 


He Lys 


Ala 








300 






Phe 


Gin 


Thr 


Val Ser 


Trp His 


Phe 






315 






320 


Ser 


Cys 


Leu 


Asn Pro 


Val Leu 


Tyr 




330 






335 




Arg 


Cys 


Phe 


Arg Glu 


Phe Cys 


He 


345 








350 




Gin 


Asn 


Ser 


Ala Arg 


He Arg 


Gin 








365 






Ala 


Asn 


Thr 


Val Asp 


Arg Thr 


Asn 








380 






Glu 


Thr 


Ala 


Pro Leu 


Pro 





395 



<210> 18 
<211> 2229 
<212> DNA 



25/53 



WO 2004/073646 



PCTAJS2004/004914 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 18 

cggatcctta gcatccccaa agcgcctccg tgtacttcta aggtgggagg gggatacaag 60 
cagaggagaa tatcggacgc tcagacgttc cattctgcct gccgctcttc tctggttcca 120 
ctagggcttg tccttgtaag aaactgacgg agcctagggc agctgtgaga ggaagaggct 180 
ggggcgcctg gaacccgaac actcttgagt gctctcagtt acagcctacc gagtccgcag 240 
caagcattca gaaccatgga cagcagcgcc ggcccaggga acatcagcga ctgctctgac 300 
cccttagctc ctgcaagttg gtccccagca cctggctcct ggctcaactt gtcccacgtt 360 
gatggcaacc agtccgaccc atgcggtcct aaccgcacgg ggcttggcgg gagccacagc 420 
ctgtgccctc agaccggcag cccttccatg gtcacagcca tcaccatcat ggccctctat 480 
tctatcgtgt gtgtagtggg cctctttgga aacttcctgg tcatgtatgt gattgtaaga 540 
tataccaaaa tgaagactgc caccaacatc tacattttca accttgctct ggcagatgcc 600 
ttagccacta gcacgctgcc ctttcagagt gttaactacc tgatgggaac gtggcccttt 660 
ggaaacatcc tctgcaagat cgtgatctca atagactact acaacatgtt caccagtatc 720 
ttcaccctct gcaccatgag tgtagaccgc tacattgccg tctgccaccc ggtcaaggcc 780 
ctggatttcc gtaccccccg aaatgccaaa attgtcaatg tctgcaactg gatcctctct 840 
tctgccattg gtctgcccgt aatgttcatg gcaaccacaa aatacaggca ggggtccata 900 
gattgcaccc tcactttctc tcatcccaca tggtactggg agaacctgct caaaatctgt 960 
gtcttcatct tcgccttcat catgccggtc ctcatcatca ctgtgtgtta tggactgatg 1020 
atcttacgac tcaagagtgt ccgcatgctg tcgggctcca aagaaaagga caggaacctg 1080 
cgcaggatca cccggatggt gctggtggtc gtggctgtat ttattgtctg ctggaccccc 1140 
atccacatct atgtcatcat caaagcactg atcacgattc cagaaaccac tttccagact - 1200 
gtttcctggc acttctgcat tgccttgggt tacacaaaca gctgcctgaa cccagttctt 1260 
tatgcgttcc tggatgaaaa cttcaaacga tgttttagag agttctgcat cccaacttcc 1320 
tccacaatcg aacagcaaaa ctctgctcga atccgtcaaa acactaggga acacccctcc 1380 
acggctaata cagtggatcg aactaaccac cagctagaaa atctggaagc agaaactgct 1440 
ccattgccct aactgggtcc cacgccatcc agaccctcgc taaacttaga ggctgccatc 1500 
tacttggaat caggttgctg tcagggtttg tgggaggctc tggtttcctg gaaaagcatc 1560 
tgatcctgca tcattcaaag tcattcctct ctggctattc acgctacacg tcagagacac 1620 
tcagactgtg tcaagcactc agaaggaaga gactgcaggc cactactgaa tccagctcat 1680 
gtacagaaac atccaatgga ccacaatact ctgtggtatg tgatttgtga tcaacataga 1740 
aggtgaccct tccctatgtg gaatttttaa tttcaaggaa atacttatga tctcatcaag 1800 
ggaaaaatag atgtcacttg ttaaattcac tgtagtgatg cataaaggaa aagctacctc 1860 
tgacctctag cccagtcacc ctctatggaa agttccatag ggaatatgtg agggaaaatg 1920 
ttgcttccaa attaaatttt cacctttatg ttatagtcta gttaagacat caggggcatc 1980 
tctgtttctt ggttttgtat tgtttgaaag aagacatctt cctccctagc tgcgtgttga 2040 
aaatgaaagg gatttaaaac acagtgtcaa ctgcagaata gttgattctc gcactgaagg 2100 
gggggggcta atcttcccaa ttctttccat gtcctccaag tgttcacaag gtcaaactca 2160 
gagtcaccca gtaagctcat catgccacqa ttctgagcaa aatccttgga ttcctgctca 2220 
gaatggtgg 2229 

<210> 19 
<211> 398 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 19 

Met Asp Ser Ser Thr Gly Pro Gly Asn Thr Ser Asp Cys Ser Asp Pro 

1 5 10 15 

Leu Ala Gin Ala Ser- Cys Ser Pro Ala Pro Gly Ser Trp Leu Asn Leu 
20 25 30 
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Ser 


His 


Val 


Asp Gly Asn 


Gin Ser 


Asp Pro Cys 


Gly 


Leu 


Asn 


Arg 


Thr 






35 




40 








45 








Gly 


Leu 


Gly 


Gly Asn Asp 


Ser Leu 


Cys Pro Gin 


Thr 


Gly 


Ser 


Pro 


Ser 




50 






55 






60 










Met 


Val 


Thr 


Ala He Thr 


He- Met 


Ala Leu Tyr 


Ser 


He 


Val 


Cys 


Val 


65 






70 






75 










80 


Val 


Gly 


Leu 


Phe Gly Asn 


Phe Leu 


Val Met 


Tyr 


Val 


He 


Val 


Arg 


Tyr 








85 




90 










95 




Thr 


Lys 


Met 


Lys Thr Ala 


Thr Asn 


He Tyr 


He 


Phe 


Asn 


Leu 


Ala 


Leu 








100 




105 








110 






Ala 


Asp 


Ala 


Leu Ala Thr 


Ser Thr 


Leu Pro 


Phe 


Gin 


Ser 


Val 


Asn 


Tyr 






115 




120 








125 








Leu 


Met 


Gly 


Thr Trp Pro 


Phe Gly 


Thr He 


Leu 


Cys 


Lys 


He 


Val 


He 




13 0 






135 






140 










Ser 


He 


Asp 


Tyr Tyr Asn 


Met Phe 


Thr Ser 


He 


Phe 


Thr 


Leu 


Cys 


Thr 


145 






150 






155 










160 


Met 


Ser 


Val 


Asp Arg Tyr 


lie Ala 


Val Cys His 


Pro 


Val 


Lys 


Ala 


Leu 








165 




170 










175 




Asp 


Phe 


Arg 


Thr Pro Arg 


Asn Ala 


Lys He 


Val 


Asn 


Val 


Cys 


Asn 


Trp 








180 . 




185 








190 






lie 


Leu 


Ser 


Ser Ala He 


Gly Leu 


Pro Val 


Met 


Phe 


Met 


Ala 


Thr 


Thr 






195 




200 








205 








Lys 


Tyr 


Arg 


Gin Gly Ser 


He Asp 


Cys Thr 


Leu 


Thr 


Phe 


Ser 


His 


Pro 




210 






215 






220 










Thr 


Trp 


Tyr 


Trp Glu Asn 


Leu Leu 


Lys He 


Cys 


Val 


Gly 


He 


Phe 


Ala 


225 






230 






235 










240 


Phe 


He 


Met 


Pro Val Leu 


lie- He 


Thr Val 


Cys 


Tyr 


Gly 


Leu 


Met 


He 








245 




250 










255 




Leu 


Arg 


Leu 


Lys Ser Val 


Arg Met 


Leu Ser Gly 


Ser 


Lys 


Glu 


Lys 


Asp 








260 




265 








270 






Arg 


Asn 


Leu 


Arg Arg lie 


Thr Arg 


Met Val 


Leu 


Val 


Val 


Val 


Ala 


Val 






275 




280 








285 








Phe 


He 


Val 


Cys Trp Thr 


Pro He 


His He 


Tyr 


Val 


He 


He 


Lys 


Ala 




290 






295 






300 










Leu 


He 


Thr 


He Pro Glu 


Thr Thr 


Phe Gin 


Thr 


Val 


Ser 


Trp 


His 


Phe 


305 






310 






315 










320 


Cys 


He 


Ala 


Leu Gly Tyr 


Thr Asn 


Ser Cys 


Leu 


Asn 


Pro 


Val 


Leu 


Tyr 








325 




330 










335 




Ala 


Phe 


Leu 


Asp Glu Asn 


Phe Lys 


Arg Cys 


Phe 


Arg 


Glu 


Phe 


Cys 


He 








340 




345 








350 






Pro 


Thr 


Ser 


Ser Thr He 


Glu Gin 


Gin Asn 


Ser 


Thr 


Arg 


Val 


Arg 


Gin 






355 




360 








365 








As n 


Thr 


Arg 


Glu His Pro 


Ser Thr 


Ala Asn 


Thr 


Val 


Asp 


Arg 


Thr 


Asn 




370 






375 






380 










His 


Gin 


Leu 


Glu Asn Leu 


Glu Ala 


Glu Thr 


Ala 


Pro 


Leu 


Pro 






385 






390 






395 













<210> 20 
<211> 1401 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 20 

cacgagcttc tgcctgccgc tcttctctgg ttccactagg gctggtccat gtaagaatct 60 
gacggagcct agggcagctg tgagaggaag aggctggggc gcgtggaacc cgaaaagtct 120 
gagtgctctc agttacagcc tacctagtcc gcagcaggcc ttcagcacca tggacagcag 180 
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caccggccca gggaacacca gcgactgctc agacccctta gctcaggcaa gttgctcccc 240 
agcacctggc tcctggctca acttgtccca cgttgatggc aaccagtccg atccatgcgg 300 
tctgaaccgc accgggcttg gcgggaacga cagcctgtgc cctcagaccg gcagcccttc 360 
catggtcaca gccattacca tcatggccct ctactctatc gtgtgtgtag tgggcctctt 420 
cggaaacttc ctggtcatgt atgtgattgt aagatacacc aaaatgaaga ctgccaccaa 480 
catctacatt ttcaaccttg ctctggcaga cgccttagcg accagtacac tgccctttca 540 
gagtgtcaac tacctgatgg gaacatggcc cttcggaacc atcctctgca agatcgtgat 600 
ctcaatagat tactacaaca tgttcaccag catattcacc ctctgcacca tgagcgtgga 660 
ccgctacatt gctgtctgcc acccagtcaa agccctggat ttccgtaccc cccgaaatgc 720 
caaaatcgtc aacgtctgca actggatcct ctcttctgcc atcggtctgc ctgtaatgtt 780 
catggcaacc acaaaataca ggcaggggtc catagattgc accctcacgt tctcccaccc 840 
aacctggtac tgggagaacc tgctcaaaat ctgtgtcttt atcttcgctt tcatcatgcc 900 
ggtcctcatc atcactgtgt gttacggcct gatgatctta cgactcaaga gcgttcgcat 960 
gctatcgggc tccaaagaaa aggacaggaa tctgcgcagg atcacccgga tggtgctggt 1020 
ggtcgtggct gtatttatcg tctgctggac ccccatccac atctacgtca tcatcaaagc 1080 
gctgatcacg attccagaaa ccacatttca gacggtttcc tggcacttct gcattgcttt 1140 
gggttacacg aacagctgcc tgaatccagt tctttacgcc ttcctggatg aaaacttcaa 1200 
gcgatgcttc agagagttct gcatcccaac ctcgtccacg atcgaacagc aaaactccac 1260 
tcgagtccgt cagaacacta gggaacatcc ctccacggct aatacagtgg atcgaactaa 1320 
ccaccagcta gaaaatctgg aggcagaaac tgctccattg ccctaactgg gtctcacacc 1380 
atccagaccc tcgctaagct t i 1401 

<210> 21 
<211> 401 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequencer/note = 
synthetic construct 



<400> 21 



Met Asp Ser Ser Ala 


Asp Pro 


Arg Asn 


Ala 


Ser Asn Cys 


Thr Asp 


Pro 


1 5 






10 








15 




Phe Ser Pro Ser Ser 


Met Cys 


Ser Pro 


Val 


Pro Ser 


Pro 


Ser 


Ser 


Trp 


20 




25 








30 






Val Asn Phe Ser His 


Leu Glu 


Gly Asn 


Leu 


Ser Asp 


Pro 


Cys 


He Arg 


35 




40 






45 








Asn Arg Thr Glu Leu 


Gly Gly 


Ser Asp 


Ser 


Leu Cys 


Pro 


Pro Thr Gly 


50 


55 ■ 






60 










Ser Pro Ser Met Val 


Thr Ala 


He Thr 


He 


Met Ala 


Leu 


Tyr Ser 


He 


65 


70 






75 








80 


Val Cys Val Val Gly 


Leu Phe 


Gly Asn 


Phe 


Leu Val 


Met 


Tyr Val 


He 


85 






90 








95 




Val Arg Tyr Thr Lys 


Met Lys 


Thr Ala 


Thr 


Asn He 


Tyr 


He 


Phe 


Asn 


100 




105 








110 






Leu Ala Leu Ala Asp 


Ala Leu 


Ala Thr 


Ser 


Thr Leu 


Pro 


Phe 


Gin 


Ser 


115 




120 






125 








Val Asn Tyr Leu Met 


Gly Thr 


Trp Pro 


Phe 


Gly Thr He 


Leu 


Cys 


Lys 


130 


135 






140 










lie Val lie Ser He 


Asp Tyr 


Tyr Asn 


Met 


Phe Thr 


Ser 


He 


Phe 


Thr 


145 


150 






155 








160 


Leu Cys Thr Met Ser 


Val Asp 


Arg Tyr 


He 


Ala Val 


Cys 


His 


Pro 


Val 


165 






170 








175 




Lys Ala Leu Asp Phe 


Arg Thr 


Pro Arg 


Asn 


Ala Lys 


He 


lie 


Asn 


Val 


180 




185 








190 






Cys Asn Trp He Leu 


Ser Ser 


Ala He 


Gly 


Leu Pro 


Val 


Met 


Phe 


Met 


195 




200 






205 








Ala Thr Thr Lys Tyr 


Arg Asn 


Gly Ser 


lie 


Asp Cys Ala 


Leu 


Thr 


Phe 


210 


215 






220 
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Ser 


His 


Pro 


Thr Trp Tyr 


Trp 


Glu Asn Leu Leu 


Lys 


He Cys Val 


Phe 


225 






230 




235 






240 


lie 


Phe 


Ala 


Phe He Met 


Pro 


Val Leu He He 


Thr 


Val Cys Tyr 


Gly 








245 




250 




255 




Leu 


Met 


He 


Leu Arg Leu 


Lys 


Ser Val Arg Met 


Leu 


Ser Gly Ser 


Lys 








260 




265 




270 




Glu 


Lys 


Asp 


Arg Asn Leu 


Arg 


Arg He Thr Arg 


Met 


Val Leu Val 


Val 






275 






280 




285 




Val 


Ala 


Val 


Phe He Val 


Cys 


Trp Thr Pro He 


His 


He Tyr Val 


He 




290 






295 




300 






lie 


Lys 


Ala 


Leu He Thr 


He 


Pro Glu Thr Thr 


Phe 


Gin Thr Val 


Ser 


305 






310 




315 






320 


Trp 


His 


Phe 


Cys He Ala 


Leu 


Gly Tyr Thr Asn 


Ser 


Cys Leu Asn 


Pro 








325 




330 




335 




Val 


Leu 


Tyr 


Ala Phe Leu 


Asp 


Glu Asn Phe Lys 


Arg 


Cys Phe Arg 


Glu 








340 




345 




350 




Phe 


Cys 


He 


Pro Thr Ser 


Ser- 


Thr He Glu Gin 


Gin 


Asn Ser Ala 


Arg 






355 






360 




365 




lie 


Arg 


Gin 


Asn Thr Arg 


Asp 


His Pro Ser Thr 


Ala 


Asn Thr Val 


Asp 




370 






375 




380 






Arg 


Thr 


Asn 


His Gin Leu 


Glu 


Asn Leu Glu Ala 


Glu 


Thr Ala Pro 


Leu 


385 






390 




395 






400 


Pro 



















*<210> 22 ..... 
<211> 1881 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note » 
synthetic construct 

<400> 22 

atgagctgtg- gtgtacttct aagatgggag ggggcaacaa gcagaagata atgtcagaag 60 
cttagctctc cttctgcctg acgctcctct ctggctccgc ctgggttggc ttctgtaaga 120 
agtagcagga gccgtggcgg gggctggagg aagcggctga ggcgcgtgga acccgaaaag 180 
cccgggtgat cgcggttacc tcactgcgtg gtcccagccg cccagccgtc agcaccatgg 240 
acagcagcgc tgacccccga aacgccagca attgcactga tcccttctcg ccctcttcaa 300 
tgtgctcccc agtacctagc cccagctcct gggtcaactt ctcccactta gaaggcaacc 360 
tgtccgaccc atgcattcgg aaccgcaccg agctgggcgg gagcgacagc ctgtgccctc 420 
cgaccggcag tccttccatg gtcacggcca tcaccatcat ggccctctac tccatcgtgt 480 
gcgtggtggg tctcttcgga aacttcctgg tcatgtatgt gattgtcaga tacaccaaaa 540 
tgaagactgc caccaacatc tatattttca accttgctct ggcggatgcc ttagccacca 600 
gtaccctacc cttccagagt gtcaattacc taatgggaac gtggccgttt ggaaccatcc 660 
tctgcaagat cgtgatctcc atagattact acaatatgtt caccagcata ttcaccctct 720 
gtaccatgag cgtggatcgc tacatcgccg tctgccatcc cgtcaaggcc ctggacttcc 780 
gcactccccg caacgccaaa atcatcaacg tctgcaactg gatcctctct tcagccattg 840 
gtctgcctgt gatgttcatg gcaacaacaa agtaccggaa tggttccata gattgtgcac 900 
taacattctc tcacccaacc tggtactggg aaaacctgct gaaaatctgt gttttcatct 960 
ttgccttcat catgcctgtc ctcatcatta cggtgtgtta tgggctgatg atcttacgcc 1020 
tcaagagtgt tcgcatgctc tctggctcca aagaaaagga taggaacctg cgaagaatca 1080 
ccaggatggt gctggtggtt gtggctgtgt tcattgtctg ctggactccc attcacattt 1140 
acgtcatcat taaagccttg attacaattc cagaaactac tttccagact gtgtcctggc 1200 
acttctgcat tgctctaggt tatacaaaca gctgcctgaa cccagtcctt tatgcatttc 1260 
tggatgaaaa cttcaaacga tgcttcagag agttctgtat cccaacctcc tccaccattg 1320 
agcagcaaaa ctccgctcga atccgtcaaa acaccagaga ccacccctcc acggccaaca 1380 
cggtggacag gaccaaccat cagctagaaa atctggaagc agaaactgct ccattgccct 1440 
aaccaggtgt catgccattc agatcctcaa tgagctaaga cagccaccat ctacgtggaa 1500 
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gcaggttgcc atgagaatgt gtgggaggca ctattttcct aggaaagtgc ctgctctgag 1560 
tcatcaaatc tgtttcctct ctggccgctc tgctctgcac atgagaggga catccaaact 1620 
aaatcaagca ctaggaagga aagaactaat ccacatggag tttgcctgtg cacataatct 1680 
caaggaagat gacccatggg accgaaacat gctgtggtat gtgcgttgag gtcatcctca 1740 
aagatggccc ttctgtatgt aatgtgctgt tttcaagcaa atgtttacgt cctcatcaaa 1800 
gaaaaaatgt cagttgttaa attcaccata gtaacttgta aaggctacct ctgatcgaag 1860 
catcttatgt ggaaatccaa g 1881 

<210> 23 
<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note - 
synthetic construct 



<400> 23 



Met 


Glu 


Pro 


Ala 


Pro 


Ser 


Ala 


Gly 


Ala 


Glu 


Leu 


Gin Pro Pro Leu Phe 


1 








5 










10 




15 


Ala- 


Asn Ala 


Ser 


Asp 


Ala 


Tyr 


Pro 


Ser 


Ala 


Phe 


Pro Ser Ala Gly Ala 








20 










25 






30 


Asn 


Ala 


Ser 


Gly 


Pro 


Pro 


Gly 


Ala 


Arg 


Ser 


Ala 


Ser Ser Leu Ala Leu 






35 










40 








45 


Ala 


He 


Ala 


He 


Thr 


Ala 


Leu 


Tyr 


Ser 


Ala 


Val 


Cys Ala Val Gly Leu 




50 










55 










60 


Leu 


Gly- Asn 


Val 


Leu 


Val 


Met 


Phe 


Gly 


He 


Val 


Arg . Tyr Thr Lys Met 


65 










70 










75 


80 


Lys 


Thr 


Ala 


Thr 


Asn 


He 


Tyr 


He 


Phe 


Asn 


Leu 


Ala Leu Ala Asp Ala 










85 










90 




95 


Leu 


Ala 


Thr 


Ser 


Thr 


Leu 


Pro 


Phe 


Gin 


Ser 


Ala 


Lys Tyr Leu Met Glu 








100 










105 






110 


Thr 


Trp 


Pro 


Phe 


Gly 


Glu 


Leu 


Leu 


Cys 


Lys 


Ala 


Val Leu Ser He Asp 






115 










120 








125 


Tyr 


Tyr 


Asn 


Met 


Phe 


Thr 


Ser 


He 


Phe 


Thr 


Leu 


Thr Met Met Ser Val 




130 










135 










140 


Asp 


Arg 


Tyr 


He 


Ala 


Val 


Cys 


His 


Pro 


Val 


Lys 


Ala Leu Asp Phe Arg 


145 










150 










155 


160 


Thr 


Pro 


Ala 


Lys 


Ala 


L ys 


Leu 


He 


Asn 


He 


Cys 


He Trp Val Leu Ala 










165 










170 




175 


Ser 


Gly 


Val 


Gly 


Val 


Pro 


He 


Met 


Val 


Met 


Ala 


Val Thr Arg Pro Arg 








180 










185 






190 


Asp 


Gly 


Ala 


Val 


Val 


Cys 


Met 


Leu 


Gin 


Phe 


Pro 


Ser Pro Ser Trp Tyr 






195 










200 








205 


Trp 


Asp 


Thr 


Val 


Thr 


Lys 


lie 


Cys 


Val 


Phe 


Leu 


Phe Ala Phe Val Val 




210 










215 










220 


Pro 


He 


Leu 


He 


He 


Thr 


Val 


Cys 


Tyr 


Gly 


Leu 


Met Leu Leu Arg Leu 


225 










230 










235 


240 


Arg 


Ser 


Val 


Arg 


Leu 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Lys Asp Arg Ser Leu 










245 










250 




255 


Arg 


Arg 


He 


Thr 


Arg 


Met 


Val 


Leu 


Val 


Val 


Val 


Gly Ala Phe Val Val 








260 










265 






270 


Cys 


Trp 


Ala 


Pro 


He 


His 


He 


Phe 


Val 


He 


Val 


Trp Thr Leu Val Asp 






275 










280 








285 


He 


Asp 


Arg 


Arg 


Asp 


Pro 


Leu 


Val 


Val 


Ala 


Ala 


Leu His Leu Cys He 




290 










295 










300 


Ala 


Leu 


Gly 


Tyr 


Ala 


Asn 


Ser 


Ser 


Leu 


Asn 


Pro 


Val Leu Tyr Ala Phe 


305 










310 










315 


320 


Leu 


Asp 


Glu 


Asn 


Phe 


Lys 


Arg 


Cys 


Phe 


Arg 


Gin 


Leu Cys Arg Lys Pro 



325 330 335 
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Cys Gly Arg Pro Asp Pro Ser Ser Phe Ser Arg Ala Arg Glu Ala Thr 

340 345 350 

Ala Arg Glu Arg Val Thr Ala Cys Thr Pro Ser Asp Gly Pro Gly Gly 

355 360 365 

Gly Ala Ala Ala 
370 



<210> 24 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note » 
synthetic construct 

<400> 24 

ccgaggagcc tgcgctgctc ctggctcaca gcgctccggg cgaggagagc gggcggaccg 60 
gggggctggg ccggtgcggg cggcgaggca ggcggacgag gcgcagagac agcggggcgg 120 
ccggggcgcg gcacgcggcg ggtcggggcc ggcctctgcc ttgccgctcc cctcgcgtcg 180 
gatccccgcg cccaggcagc cggtggagag ggacgcggcg gacgccggca gccatggaac 240 
cggccccctc cgccggcgcc gagctgcagc ccccgctctt cgccaacgcc tcggacgcct 300 
accctagcgc cttccccagc gctggcgcca atgcgtcggg gccgccaggc gcgcggagcg 360 
cctcgtccct cgccctggca atcgccatca ccgcgctcta ctcggccgtg tgcgccgtgg 420 
ggctgctggg caacgtgctt gtcatgttcg gcatcgtccg gtacactaag atgaagacgg 480 
ccaccaacat ctacatcttc aacctggcct tagccgatgc gctggccacc agcacgctgc 540 
ctttccagag tgccaagtac ctgatggaga cgtggccctt cggcgagctg ctctgcaagg 600 
ctgtgctctc catcgactac tacaatatgt tcaccagcat cttcacgctc accatgatga 660 
gtgttgaccg ctacatcgct gtctgccacc ctgtcaaggc cctggacttc cgcacgcctg 720 
ccaaggccaa gctgatcaac atctgtatct gggtcctggc ctcaggcgtt ggcgtgccca 780 
tcatggtcat ggctgtgacc cgtccccggg acggggcagt ggtgtgcatg ctccagttcc 840 
ccagccccag ctggtactgg gacacggtga ccaagatctg cgtgttcctc ttcgccttcg 900 
tggtgcccat cctcatcatc accgtgtgct atggcctcat gctgctgcgc ctgcgcagtg 960 
tgcgcctgct gtcgggctcc aaggagaagg accgcagcct gcggcgcatc acgcgcatgg 1020 
tgctggtggt tgtgggcgcc ttcgtggtgt gttgggcgcc catccacatc ttcgtcatcg 108 0 
tctggacgct ggtggacatc gaccggcgcg acccgctggt ggtggctgcg ctgcacctgt 1140 
gcatcgcgct gggctacgcc aatagcagcc tcaaccccgt gctctacgct ttcctcgacg 1200 
agaacttcaa gcgctgcttc cgccagctct gccgcaagcc ctgcggccgc ccagacccca 1260 
gcagcttcag ccgcgcccgc gaagccacgg cccgcgagcg tgtcaccgcc tgcaccccgt 1320 
ccgatggtcc cggcggtggc gctgccgcct gaccaggcca tccggccccc agacgcccct 1380 
ccctagttgt acccggaggc cacatgagtc ccagtgggag gcgcgagcca tgatgtggag 1440 
tggggccagt agataggtcg gagggctttg ggaccgccag atggggcctc tgtttcggag 1500 
acgggaccgg gccgctagat gggcatgggg tgggcctctg gtttggggcg aggcagagga 1560 
cagatcaatg gcgcagtgcc tctggtctgg gtgcccccgt ccacggctct aggtggggcg 162 0 
ggaaagccag tgactccagg agaggagcgg gacctgtggc tctacaactg agtccttaaa 1680 
cagggcatct ccaggaaggc ggggcttcaa ccttgagaca gcttcggttt ctaacttgga 1740 
gccggacttt cggagttggg gggtccgggg ccc 1773 

<210> 25 
<211> 228 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 25 

Gly lie Val Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn He Tyr He 
1 5 10 15 
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Phe 


Asn 


Leu 


Ala 


Leu 


Ala 


Asp Ala 


Leu Ala 


Thr Ser- Thr 


Leu 


Pro 


Phe 








20 








25 




30 






Gin 


Ser 


Ala 


Lys 


Tyr 


Leu 


Met Glu 


Thr Trp 


Pro Phe Gly 


Glu 


Leu 


Leu 






35 








40 




45 








Cys 


Lys 


Ala 


Val 


Leu 


Ser 


He Asp 


Tyr Tyr 


Asn Met Phe 


Thr 


Ser 


He 




50 










55 




60 








Phe 


Thr 


Leu 


Thr 


Met 


Met 


Ser Val 


Asp Arg 


Tyr He Ala 


Val 


Cys 


His 


65 










70 






75 






80 


Pro 


Val 


Lys 


Ala 


Leu 


Asp 


Phe Arg 


Thr Pro 


Ala Lys Ala 


Lys 


Leu 


He 










85 






90 






95 




Asn 


lie 


Cys 


He 


Trp 


Val 


Leu Ala 


Ser Gly 


Val Gly Val 


Pro 


He 


Met 








100 








105 




110 






Val 


Met 


Ala 


Val 


Thr 


Arg 


Pro Arg 


Asp Gly 


Ala Val Val 


Cys 


Met 


Leu 






115 








120 




125 








Gin 


Phe 


Pro 


Ser 


Pro 


Ser 


Trp Tyr 


Trp Asp 


Thr Val Thr 


Lys 


lie 


Cys 




130 










135 




140 








Val 


Phe 


Leu 


Phe 


Ala 


Phe 


Val Val 


Pro He 


Leu Val He 


Thr 


Val 


Cys 


145 










150 






155 






160 


Tyr 


Gly 


Leu 


Met 


Leu 


Leu 


Arg Leu 


Arg Ser 


Val Arg Leu 


Leu Ser Gly 










165 






170 






175 




Ser 


Lys 


Glu 


Lys 


Asp 


Arg 


Ser Leu 


Arg Arg 


He Thr Arg 


Met 


Val 


Leu 








180 








185 




190 






Val 


Val 


Val 


Gly 


Ala 


Phe 


Val Val 


Cys Trp 


Ala Pro He 


His 


He 


Phe 






195 








200 




205 








Val 


lie 


Val 


Trp 


Thr 


Leu 


Val Asp 


He Asp 


Arg Arg Asp 


Pro 


Leu 


Val 




210 










215 




220 








Val 


Ala 


Ala 


Leu 


















225 

























<210> 26 
<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note =. 
synthetic construct 



<400> 26 



Met 


Glu 


Pro 


Val 


Pro 


Ser 


Ala 


Arg 


Ala 


Glu 


Leu 


Gin 


Phe 


Ser 


Leu 


Leu 


1 








5 










10 










15 




Ala 


Asn 


Val 


Ser 
20 


Asp 


Thr 


Phe 


Pro 


Ser 
25 


Ala 


Phe 


Pro 


Ser 


Ala 
30 


Ser 


Ala 


Asn 


Ala 


Ser 
35 


Gly 


3er 


Pro 


Gly 


Ala 
40 


Arg 


Ser 


Ala 


Ser 


Ser 
45 


Leu 


Ala 


Leu 


Ala 


He 


Ala 


He 


Thr 


Ala 


Leu 


Tyr 


Ser 


Ala 


Val 


Cys Ala 


Val 


Gly Leu 




50 










55 










60 










Leu 


Gly 


Asn 


Val 


Leu 


Val 


Met 


Phe 


Gly 


He 


Val 


Arg Tyr 


Thr Lys 


Leu 


65 










70 










75 










80 


Lys 


Thr 


Ala 


Thr 


Asn 


He 


Tyr 


He 


Phe 


Asn 


Leu 


Ala 


Leu 


Ala Asp Ala 










85 










90 










95 




Leu 


Ala 


Thr 


Ser 
100 


Thr 


Leu 


Pro 


Phe 


Gin 
105 


Ser 


Ala 


Lys 


Tyr 


Leu 
110 


Met 


Glu 


Thr 


Trp 


Pro 
115 


Phe 


Gly 


Glu 


Leu 


Leu 
120 


Cys 


Lys 


Ala 


Val 


Leu 
125 


Ser 


He 


Asp 


Tyr 


Tyr 
130 


Asn 


Met 


Phe 


Thr 


Ser 
135 


He 


Phe 


Thr 


Leu 


Thr 
140 


Met 


Met 


Ser 


Val 


Asp 


Arg 


Tyr 


He 


Ala 


Val 


Cys 


His 


Pro 


Val 


Lys 


Ala 


Leu 


Asp 


Phe 


Arg 



145 150 155 160 
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Thr 


Pro 


Ala Lys Ala 
165 


Lys 


Leu He Asn lie Cys He 
170 


Trp 


Val 


Leu 

1 /D 


Ala 


Ser 


Gly 


Val Gly Val 


Pro 


lie Met val Met Aia vai 


inr 


bin 


Pro 


Arg 




180 




185 










Asp 


Gly 


Ala Val Val 


Cys 


Thr Leu Gin Phe Pro Ser 


Pro 


Ser 


Trp 


Tyr 


195 




200 


205 








Trp 


Asp 
210 


Thr Val Thr 


Lys 


lie cys va.i Fne i»eu pne 
215 220 


i\±cL 




Vdl 


Val 


Pro 


He 


Leu He He 


Thr 


Val Cys Tyr Gly Leu Met 


Leu 


Leu 


Arg 


Leu 


225 






230 


235 . 








240 


Arg 


Ser 


Val Arg Leu 


Leu 


Ser Gly Ser Lys Glu Lys 


Asp 


Arg 


Ser 


Leu 




245 




250 






255 




Arg 


Arg 


He Thr Arg 


Met 


Val Leu Val Val Val Gly 


Ala 


Pne 


Val 


val 


260 




265 




270 






Cys 


Trp 


Ala Pro He 


His 


He Phe Val He Val Trp 


Thr 


Leu 


Val 


Asp 


275 




280 


285 








He 


Asn 
290 


Arg Arg Asp 


Pro 


Leu Val Val Ala Ala Leu 
295 300 


HIS 


Leu 


Cys 


lie 


Ala 


Leu 


Gly Tyr Ala 


Asn 


Ser Ser Leu Asn Pro Val 


Leu 


Tyr 


Aia 


pne 


305 




310 


315 








jiZV 


Leu 


Asp 


Glu Asn Phe 

325 


Lys 


Arg Cys Phe Arg Gin Leu 
330 


Cys 


Arg 


Ala 

QIC 
J .3D 


Pro 


Cys 


Gly 


Gly Gin Glu 


Pro 


Gly Ser Leu Arg Arg Pro 


Arg 


Gin 


Ala 


Thr 




• 340 




345 




350 






Ala 


Arg 


Glu Arg Val 


Thr 


Ala Cys Thr Pro Ser Asp 


Gly 


Pro 


Gly 


Gly 




355 




360 


365 








Gly 


Ala 
370 


Ala- Ala 















<210> 27 
<211> 372 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 27 


























Met 


Glu 


Leu 


Val 


Pro 


Ser 


Ala 


Arg Ala 


Glu 


Leu 


Gin 


Ser 


Ser 


Pro 


Leu 


1 








5 








10 










15 




Val 


Asn 


Leu 


Ser 


Asp 


Ala 


Phe 


Pro Ser 


Ala 


Phe 


Pro 


Ser 


Ala 


Gly 


Ala 








20 








25 










30 






Asn 


Ala 


Ser 


Gly 


Ser 


Pro 


Gly 


Ala Arg 


Ser 


Ala 


Ser 


Ser 


Leu 


Ala 


Leu 






35 










40 








45 








Ala 


He 


Ala 


He 


Thr 


Ala 


Leu 


Tyr Ser 


Ala 


Val 


Cys 


Ala 


Val 


Gly 


Leu 




50 










55 








60 










Leu 


Gly 


Asn 


Val 


Leu 


Val 


Met 


Phe Gly 


He 


Val 


Arg 


Tyr 


Thr 


Lys 


Leu 


65 








70 








75 










80 


Lys 


Thr 


Ala 


Thr 


Asn 


He 


Tyr 


He Phe 


Asn 


Leu 


Ala 


Leu 


Ala 


Asp 


Ala 








85 








90 










95 




Leu 


Ala 


Thr 


Ser 


Thr 


Leu 


Pro 


Phe Gin 


Ser 


Ala 


Lys 


Tyr 


Leu 


Met 


Glu 








100 








105 










110 






Thr 


Trp 


Pro 


Phe 


Gly 


Glu 


Leu 


Leu Cys 


Lys 


Ala 


Val 


Leu 


Ser 


He 


Asp 




115 










120 








125 








Tyr 


Tyr 


Asn 


Met 


Phe 


Thr 


Ser 


He Phe 


Thr 


Leu 


Thr 


Met 


Met 


Ser 


Val 


130 










135 








140 










Asp 


Arg 


Tyr 


He 


Ala 


Val 


Cys 


His Pro 


Val 


Lys 


Ala 


Leu 


Asp 


Phe 


Arg 


145 








150 








155 










160 
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Thr 


Pro 


Ala 


Lys 


Ala 
165 


Lys 


Leu 


He 


Asn 


He 
170 


Cys 


He 


Trp Val Leu Ala 
175 


Ser 


Gly 


Val 


Gly 


Val 


Pro 


He 


Met 


Val 


Met 


Ala 


Val 


Thr Gin Pro Arg 






180 










185 








190 


Asp 


Gly 


Ala 


Val 


Val 


Cys 


Met 


Leu 


Gin 


Phe 


Pro 


Ser 


Pro Ser Trp Tyr 




195 










200 










205 


Trp 


Asp 
210 


Thr 


Val 


Thr 


Lys 


lie 
215 


Cys 


Val 


Phe 


Leu 


Phe 
220 


Ala Phe Val Val 


Pro 


He 


Leu 


He 


He 


Thr 


Val 


Cys 


Tyr 


Gly 


Leu 


Met 


Leu Leu Arg Leu 


225 










230 










235 




240 


Arg 


Ser 


Val 


Arg 


Leu 
245 


Leu 


Ser 


Gly 


Ser 


Lys 
250 


Glu 


Lys 


Asp Arg Ser Leu 
255 


Arg 


Arg 


He 


Thr 
260 


Arg 


Met 


Vai 


Leu 


Val 
265 


Val 


Val 


Gly 


Ala Phe Val Val 
270 


Cys 


Trp 


Ala 


Pro 


He 


His 


He 


Phe 


Val 


He 


Val 


Trp 


Thr Leu Val Asp 


275 










280 










285 


He 


Asn 
290 


Arg 


Arg 


Asp 


Pro 


Leu 
295 


Val 


Val 


Ala 


Ala 


Leu 
300 


His Leu Cys He 


Ala 


Leu 


Gly 


Xyr 


Ala 


Asn 


Ser 


Ser 


Leu 


Asn 


Pro 


Val 


Leu Tyr Ala Phe 


305 










310 










315 




320 


Leu 


Asp 


Glu 


Asn 


Phe 


Lys Arg 


Cys 


Phe 


Arg 


Gin 


Leu 


Cys Arg Thr Pro 










325 










330 






335 


Cys 


Gly 


Arg 


Gin 


Glu 


Pro Gly 


Ser 


Leu 


Arg 


Arg 


Pro 


Arg Gin Ala Thr 








340 










345 








• •350 


Thr 


Arg 


Glu 
355 


Arg 


Val 


Thr 


Ala 


Cys 
360 


Thr 


Pro 


Ser 


Asp 


Gly Pro Gly Gly 
365 


Gly 


Ala 
370 


Ala 


Ala 





















<210> 28 
<211> 2219 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 28 

ctctaaaggc tgggtccctg cgcccagggc gcacggtgga gacggacacg gcggcgccat 60 
ggagctggtg ccctctgccc gtgcggagct gcagtcctcg cccctcgtca acctctcgga 120 
cgcctttccc agcgccttcc ccagcgcggg cgccaatgcg tcggggtcgc cgggagcccg 180 
tagtgcctcg tccctcgccc tagccatcgc catcaccgcg ctctactcgg ctgtgtgcgc 240 
agtggggctt ctgggcaacg tgctcgtcat gtttggcatc gtccggtaca ccaaattgaa 300 
gaccgccacc aacatctaca tcttcaatct ggctttggct gatgcgctgg ccaccagcac 360 
gctgcccttc cagagcgcca agtacttgat ggaaacgtgg ccgtttggcg agctgctgtg 420 
caaggctgtg ctctccattg actactacaa catgttcact agcatcttca ccctcaccat 480 
gatgagcgtg gaccgctaca ttgctgtctg ccatcctgtc aaagccctgg acttccggac 540 
accagccaag gccaagctga tcaatatatg catctgggtc ttggcttcag gtgtcggggt 600 
ccccatcatg gtcatggcag tgacccaacc ccgggatggt gcagtggtat gcatgctcca 660 
gttccccagt cccagctggt actgggacac tgtgaccaag atctgcgtgt tcctctttgc 720 
cttcgtggtg ccgatcctca tcatcacggt gtgctatggc ctcatgctac tgcgcctgcg 780 
cagcgtgcgt ctgctgtccg gttccaagga gaaggaccgc agcctgcggc gcatcacgcg 840 
catggtgctg gtggtggtgg gcgccttcgt ggtgtgctgg gcgcccatcc acatcttcgt 900 
catcgtctgg acgctggtgg acatcaatcg gcgcgaccca cttgtggtgg ccgcactgca 960 
cctgtgcatt gcgctgggct acgccaacag cagcctcaac ccggttctct acgccttcct 1020 
ggacgagaac ttcaagcgct gcttccgcca gctctgtcgc acgccctgcg gccgccaaga 1080 
acccggcagt ctccgtcgtc cccgccaggc caccacgcgt gagcgtgtca ctgcctgcac 1140 
cccctccgac ggcccgggcg gtggcgctgc cgcctgacct acccgacctt ccccttaaac 1200 
gcccctccca agtgaagtga tccagaggcc acaccgagct ccctgggagg ctgtggccac 1260 
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caccaggaca gctagaattg ggcctgcaca gaggggaggc ctcctgtggg gacggggcct 1320 
gagggatcaa aggctccagg ttggaacggt gggggtgagg aagcagagct ggtgattcct 1380 
aaactgtatc cattagtaag gcctctccaa tgggacagag cctccgcctt gagataacat 1440 
cgggttctgg cctttttgaa cacccagctc cagtccaaga cccaaggatt ccagctccag 1500 
gaaccaggag gggcagtgat ggggtcgatg atttggtttg gctgagagtc ccagcatttg 1560 
tgttatgggg aggatctctc atcttagaga agataagggg acagggcatt caggcaaggc 1620 
agcttggggt ttggtcagga gataagcgcc cccttccctt ggggggagga taagtggggg 1680 
atggtcaacg ttggagaaga gtcaaagttc tcaccacctt tctaactact cagctaaact 1740 
cgttgaggct agggccaacg tgacttctct gtagagagga tacaagccgg gcctgatggg 1800 
gcaggcctgt gtaatcccag tcatagtgga ggctgaggct ggaaaattaa ggaccaacag 1860 
cctgggcaat ttagtgtctc aaaataaaat gtaaagaggg ctgggaatgt agctcagtgg 1920 
tagggtgttt gtgtgaggct ctgggatcaa taagacaaaa caaccaacca accaaaaacc 1980 
ttccaaacaa caaaaccaac cctcaaacca aaaaactatg tgggtgtctc tgagtctggt 2040 
ttgaagagaa cccgcagccc tgtatccctg tggggctgtg gacagtgggc agaagcagag 2100 
gctccctgga tcctgaacaa gggccccaaa agcaagttct aaagggaccc ctgaaaccga 2160 
gtaagccttt gtgtcaagaa gtgggagtac aaccagaaag gtggctgagt gctttagag 2219 

<210> 29 
<211> 380 
<212> PRT 

<213> Artificial Sequence 
<22p> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 29' 

Met Asp Ser Pro He Gin He Phe Arg Gly Glu Pro Gly Pro Thr Cys 

1 5 io ■ * 15 

Ala Pro Ser Ala Cys Leu Pro Pro Asn Ser Ser Ala Trp Phe Pro Gly 

20. .25 30 

Trp Ala Glu Pro Asp Ser Asn Gly Ser Ala Gly Ser Glu Asp Ala Gin ' 

35 40 45 

Leu Glu Pro Ala His He Ser Pro Ala He Pro Val He He Thr Ala 

50 55 60 

Val Tyr Ser Val Val Phe Val Val Gly Leu Val Gly Asn Ser Leu Val 
65 70 75 80 

Met. Phe Val He lie Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn lie 

85 90 95 

Tyr He Phe Asn Leu Ala Leu Ala Asp Ala Leu Val Thr Thr Thr Met 

100 105 110 

Pro Phe Gin Ser Thr Val Tyr Leu Met Asn Ser Trp Pro Phe Gly Asp 

115 120 125 

Val Leu Cys Lys He Val He Ser He Asp Tyr Tyr Asn Met Phe Thr 

130 135 140 

Ser He Phe Thr Leu Thr Met Met Ser Val Asp Arg Tyr He Ala Val 
145 150 155 ~ 160 

Cys His Pro Val Lys Ala Leu Asp Phe Arg Thr Pro Leu Lys Ala Lys 

165. 170 ^ 175 

He He Asn He Cys He Trp Leu Leu Ser Ser Ser Val Gly He Ser 

180 185 190 

Ala He Val Leu Gly Gly Thr Lys Val Arg Glu Asp Val Asp Val He 

195 200 205 

Glu Cys Ser Leu Gin Phe Pro Asp Asp Asp Tyr Ser Trp Trp Asp Leu 

210 215 220 

Phe Met Lys He Cys Val Phe He Phe Ala Phe Val He Pro Val Leu 
225 230 235 240 

lie lie lie Val Cys Tyr Thr Leu Met He Leu Arg Leu Lys Ser Val 

245 250 255 

Arg Leu Leu Ser Gly Ser Arg Glu Lys Asp Arg Asn Leu Arg Arg He 
260 265 . • 270 
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Thr Arg Leu Val Leu Val Val Val 
275 280 
Pro He His He Phe He Leu Val 

290 295 
Ser Thr Ala Ala Leu Ser Ser Tyr 
305 310 
Thr Asn Ser Ser Leu Asn Pro He 
325 

Phe Lys Arg Cys Phe Arg Asp Phe 
340 

Glu Arg Gin Ser Thr Ser Arg Val 
355 360 
Tyr Leu Arg Asp He Asp Gly Met 
370 375 



Ala Val Phe Val. Val Cys Trp Thr 
285 

Glu Ala Leu Gly Ser Thr Ser His 
300 

Tyr Phe Cys He Ala Leu Gly Tyr 
315 320 
Leu Tyr Ala Phe Leu Asp Glu Asn 

330 335 
Cys Phe Pro Leu Lys Met Arg Met 
345 350 
Arg Asn Thr Val Gin Asp Pro Ala 
365 

Asn Lys Pro Val 
380 



<210> 30 
<211> 1154 • 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 30 

atggactccc cgatccagat cttccgcggg gagccgggcc ctacetgcgc -cccgagcgcc 60 
tgcctgcccc ccaacagcag cgcctggttt cccggctggg ccgagcccga cagcaacggc 120 
agcgccggct cggaggacgc gcagctggag cccgcgcaca tctccccggc catcccggtc 180 
atcatcacgg cggtctactc cgtagtgttc gtcgtgggct tggtgggcaa ctcgctggtc 240 
atgttcgtga tcatccgata cacaaagatg aagacagcaa ccaacattta catatttaac 300 
ctggctttgg cagatgcttt agttactaca accatgccct ttcagagtac ggtctacttg 360 
atgaattcct ggccttttgg ggatgtgctg tgcaagatag taatttccat tgattactac 420 
aacatgttca ccagcatctt caccttgacc atgatgagcg tggaccgcta cattgccgtg 480 
tgccaccccg tgaaggcttt ggacttccgc acacccttga aggcaaagat catcaatatc 540 
tgcatctggc tgctgtcgtc atctgttggc atctctgcaa tagtccttgg aggcaccaaa 600 
gtcagggaag acgtcgatgt cattgagtgc tccttgcagt tcccagatga tgactactcc 660 
tggtgggacc tcttcatgaa gatctgcgtc ttcatctttg ccttcgtgat ccctgtcctc 720 
atcatcatcg tctgctacac cctgatgatc ctgcgtctca agagcgtccg gctcctttct 780 
ggctcccgag agaaagatcg caacctgcgt aggatcacca gactggtcct ggtggtggtg 840 
gcagtcttcg tcgtctgctg gactcccatt cacatattca tcctggtgga ggctctgggg 900 
agcacctccc acagcacagc tgctctctcc agctattact tctgcatcgc cttaggctat 960 
accaacagta gcctgaatcc cattctctac gcctttcttg atgaaaactt caagcggtgt 1020 
ttccgggact tctgctttcc actgaagatg aggatggagc ggcagagcac tagcagagtc 1080 
cgaaatacag ttcaggatcc tgcttacctg agggacatcg atgggatgaa taaaccagta 1140 
tgactagtcg tgga 1154 

<210> 31 
<211> 380 
<212> PRT 

<213> Artificial Sequence 

<220> ^ 
<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 31 

Met Glu Ser Pro He Gin Tie Phe Arg Gly Asp Pro Gly Pro Thr Cys 

15 io 15 

Ser Pro Ser Ala Cys Leu Leu Pro Asn Ser Ser Ser Trp Phe Pro Asn 
20 25 .30 
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XI -P 


Ala 


Glu 

VJl u. 


oei x-\.£ap oei rio xx 
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Ser 


Glu 


Asp 
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35 




40 










45 








Leu 


Glu 


Ser 


Ala HiR Tie Ser 


Pro 


Ala 


lie 


Pro 


Val 


lie 


lie 


Thr 


Ala 




50 
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VaJ. 











370 375 380 



<210> 32 
<211> 1410 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note a 
synthetic construct 

<400> 32 

gcgcaccttg ctgatcccaa acaggcagag cttcttccag tcttggaagg cacaaattga 60 
gcatcaggaa cgtggaccca tcagggctga acagctactc aggatctaaa gtggtgactt 120 
ggaaagctga cggtgacttg ggaagggagg tcgccaatca gcgatctgga gctgcagcgc 180 
tcaccatgga gtcccccatt cagatcttcc gaggagatcc aggccctacc tgctctccca 240 
gtgettgect tctccccaac agcagctctt ggttccccaa ctgggcagaa tccgacagta 300 
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atggcagtgt gggctcagag gatcagcagc tggagtccgc gcacatctct ccggccatcc 360 
ctgttatcat caccgctgtc tactctgtgg tatttgtggt gggcttagtg ggcaattctc 420 
tggtcatgtt tgtcatcatc cgatacacga agatgaagac cgcaaccaac atctacatat 480 
ttaacctggc tttggcagat gctttggtta ctaccactat gccctttcag agtgctgtct 540 
acttgatgaa ttcttggcct tttggagatg tgctatgcaa gattgtcatt tccattgact 600 
actacaacat gtttaccagc atattcacct tgaccatgat gagtgtggac cgctacattg 660 
ctgtgtgcca ccctgtgaaa gctttggact tccgaacacc tttgaaagca aagatcatca 720 
acatctgcat ttggctcctg gcatcatctg ttggtatatc agcgatagtc cttggaggca 780 
ccaaagtcag ggaagatgtg gatgtcattg aatgctcctt gcagtttcct gatgatgaat 840 
attcctggtg ggatctcttc atgaagatct gtgtcttcgt ctttgccttt gtgatcccag 900 
tcctcatcat cattgtctgc tacaccctga tgatcctgcg cct'gaagagt gtccggctcc 960 
tgtctggctc ccgagagaag gaccgaaatc tccgccgcat caccaagctg gtgctggtag 1020 
tagttgcagt cttcatcatc tgttggaccc ccattcacat ctttatcctg gtggaggctc 1080 
tgggaagcac ctcccacagc acagctgccc tctccagcta ttatttctgt attgccttgg 1140 
gttataccaa cagcagcctg aatcctgttc tctatgcctt tctggatgaa aacttcaagc 1200 
ggtgttttag ggacttctgc ttccctatta agatgcgaat ggagcgccag agcaccaata 1260 
gagttagaaa cacagttcag -gatcctgctt ccatgagaga tgtgggaggg atgaataagc 1320 
cagtatgact agtcgtggaa atgtcttctt attgttctcc aggtagagaa gagttcaatg 1380 
atcttggttt aacccagatt acaactgcag 1410 

<210> 33 • 

<211> 380 ... 

<212> PRT . . , 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<4.00> 33 

Met Glu Ser Pro lie Gin lie- Phe Arg Gly Glu Pro Gly Pro Thr Cys 

1 5 10 15 

Ala Pro Ser Ala Cys Leu Leu Pro Asn Ser Ser Ser Trp .Phe Pro Asn 

20 25 * 30 

Trp Ala Glu Ser Asp Ser Asn Gly Ser Val Gly Ser Glu Asp Gin Gin 

35 40 45 

Leu Glu Pro Ala His lie Ser Pro Ala lie Pro Val lie He Thr Ala 

50 55 60 

Val Tyr Ser Val Val Phe Val Val Gly Leu Val Gly Asn Ser Leu Val 
65 70 75 80 

Met Phe Val lie He Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn He 

85 90 95 

Tyr He Phe Asn Leu Ala Leu Ala Asp Ala Leu Val Thr Thr Thr Met - 

100 105 no 

Pro Phe Gin Ser Ala Val Tyr Leu Met Asn Ser Trp Pro Phe Gly Asp 

115 120 125 

Val Leu Cys Lys He Val He Ser He Asp Tyr Tyr Asn Met Phe Thr 

130 135 140 

Ser He Phe Thr Leu Thr Met Met Ser Val Asp Arg Tyr He Ala Val 
145 150 155 160 

Cys His Pro Val Lys Ala Leu Asp Phe Arg Thr Pro Leu Lys Ala Lys 

165 170 175 

He He Asn He Cys He Trp Leu Leu Ala Ser Ser Val Gly He Ser 

180 185 190 

Ala He Val Leu Gly Gly Thr Lys Val Arg Glu Asp Val Asp Val He 

195 200 205 

Glu Cys Ser Leu Gin Phe Pro Asp Asp Glu Tyr Ser Trp Trp Asp Leu 

210 215 220 

Phe Met Lys He Cys Val Phe Val Phe Ala Phe Val He Pro Val Leu 
225 230 235 240 
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He 
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330 






335 
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Met 
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350 
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Pro 
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365 



Asn Lys Pro Val 
380 



<210> 34 
<211> 2481 
<212> DNA 

<2.13> Artificial- Sequence 
<220>' ' 

<223> Description of Artificial Sequence : /note- - 
synthetic construct 

<4fl0> 34 

tccagccctg cctgcacagg caaagtttgt ctctctgggt ccaatctgtc ctctgctcct 60 

gcagcccggc aagtgccatc gctcctcggt ttccagctgc agcactcacc atggagtccc 120 

ccatccagat tttccgcgga gagccaggcc ctacctgtgc tcccagtgct tgcctactcc 180 

ccaacagcag ctcttggttc cccaactggg ccgaatcgga cagcaatggc agtttgggct 240 

ccgaggacca gcagctggag cccgcgcaca tctctccagc catccctgtt atcatcaccg 300 

ctgtctactc tgtggtgttt gtggtgggct tagtgggcaa ttccctggtc atgtttgtca 360 

tcatccgata cacaaagatg aagaccgcaa ccaacatcta catatttaac ctggctttgg 420 

cagatgcttt ggttactacc actatgccct tccagagtgc tgtctacttg atgaattctt 480 

ggccttttgg agatgttctg tgcaagattg tcatttccat tgactactac aacatgttta 540 

ccagcatatt caccttgacc atgatgagtg tggaccgcta cattgccgtg tgccaccctg 600 

tgaaagcttt ggatttccga acacctttga aagcaaagat catcaacatc tgcatttggc 660 

tactggcatc atctgttggt atatcagcga tagtccttgg aggcaccaaa * gtcagggaag 720 

atgtggatgt cattgaatgc tccttgcagt ttcctgatga tgaatattcc tggtgggacc 780 

tcttcatgaa gatctgtgtc ttcgtctttg cctttgttat ccctgtctta atcatcattg 840 

tctgctacac cctgatgatc ctgcgcttga agagtgtccg gctcctctcg ggctctcgag 900 

agaaggaccg aaatctccgc cggatcacca agctggtgct ggtagtggtt gcagtcttca 960 

tcatctgttg gacccccatc cacatcttta tcctggtcga ggctctaggc agcacctctc 1020 

acagcacagc tgtcctctct agctattact tctgcattgc cttgggttat accaacagca 1080 

gcttgaatcc tgttctctat gcctttcttg atgaaaactt caagcggtgt tttagggact 1140 

tctgcttccc cattaagatg cgaatggagc gccagagcac aaacagagtt agaaacacag 1200 

ttcaggatcc tgcttccatg agggatgtgg gtgggatgaa taagccagta tgactagtca 1260 

tggaaatgtc ttcctattgt tctccgggta gagaagagtt caatgatctt ggtttaaccc 1320 

agattaccac tgcagtctga agaggaaaga tgaggtattc aataacttag ccatgttatg 1380 

caatctaaag gtgcagggca cattagtgat ctaggctgag taggggcagc aagtgtgaag 1440 

aacagagcac atgtcctggc aacaatacac ctctttccta ggacagagga gaaggcaatc 1500 

taacctcaac ccttcgataa acagacagca ctctttcttc tggtcccctt gatttactgc 1560 

acctccatct gcgtggcctt tctgtgtaac atagttccaa agctctagag aagaaaatga 1620 

aagaaaaagt gcatttgatc caaaacttac tgggcaccca accttcgcgt taacacaggc 1680 

aaaaccaact tcgtatacaa tgaggccata ccaagggtca tatccaactc actttctgct 1740 

ggtgatcatg tcttatgtgt gatgagagtg aaacttagga agaacctgga ctcagaccct 1800 

gacactgggg gagagcacca tattgacatt tgtgaaccta tttaaagttg tgggtgttct 1860 
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cgtctcacag tgtctaatgc cttgaaaaac tacagttgct tcttaaggtc tctggttttt 1920 
agcatgctat tcaggaagat aatcttctga gaaaacatga actgatatta aaaggttgaa 1980 
gcttaatacc agcaaagtgt gtgtaatttc atctgtaaat agtggtctgt atataaataa 2040 
ggaccaggtt ttcctgtcca gcctgtacat ttctcaagga tgccgtagac acacccctgg 2100 
aggcatggaa agttcatgct gggatatttt gcttcactat aagctacttt cttgatttgg 2160 
tcttggtgtg atttctacta gattactcaa acattattta ctctaacact gatcataact 2220 
tggtgttaac aattccccaa actttgaatt cattctaaag tgttagcatt gatcaaatct 2280 
actttgtggt agcatctgtt tgtaaacaca cacatattgc cagattctct actcaggtag 2340 
aggaagttgc tttgatcatg tacaccttca aatgttatgq tctggctttc cacagaaagt 2400 
ggaattgttt caaaatgcat gctgaaaaag gaaataggat ttgagatggc ttagcacaat 2460 
ttgcatggta ttgagtaaga g 2481 

<210> 35 
<211> 21 
<212> DNA 

<213> Artificial Sequence- 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 35 

tttttccagt tccgtttatc c -21 ■ 

<210> 36 . 

<211> 20 . 
<212> DNA 

<213> Artificial Sequence 
<220>. 

<223> Description of Artificial Sequence : /note t> 
synthetic construct 

<400> 36 

tttatcgcca atccacatct 20 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> .... 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 37 

cccatagtaa cgccaatagg 20 

<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 38 

aaatgtgagc gagtaacaac c 21 
<210> 39 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 39 

accacagtcc atgccatcac 20 

<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<210> 41 

<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 41 

cacctaatac gactcactat agg 23 

<210> 42 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 



<210> 43 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note » 
synthetic construct 

<400> 43 

atatcgatat cgctagcttt taaaagaaaa ggggg 35 



<400> 40 

tccaccaccc tgttgctgta 



20 



<400> 42 

cattaaccct cactaaag 



18 



<210> 44 



41/53 



WO 2004/073646 



PCT/US2004/004914 



<211> 39 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 44 

taatcgatgc taagcaaaat tttgaatttt tgtaatttg 39 

<210> 45 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note » 

synthetic construct 
<400> 45 

gaattaccta atgggaacat gg 22 

<210> 46 

<211> 18 

<212> DNA • - 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : /note » 
synthetic construct 



<400> 46 

gcagacgatg aacacagc 18 

<210> 47 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 47 

accacagcaa tcac 14 



<210> 48 
<211> 10472 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 48 

aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca 60 
tgccttacaa. ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga 120 
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tcgtgcctta ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt 180 
gccgcattgc agagatattg tatttaagtg cctagctcga tacataaacg ggtctctctg 240 
gttagaccag atctgagcct gggagctctc tggctaacta gggaacccac tgcttaagcc 300 
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 360 
taactagaga tccctcagac ccttttagtc agtgtggaaa atctctagca gtggcgcccg 420 
aacagggact tgaaagcgaa agggaaacca gaggagctct ctcgacgcag gactcggctt 480 
gctgaagcgc gcacggcaag aggcgagggg cggcgactgg tgagtacgcc aaaaattttg 540 
actagcggag gctagaagga gagagatggg tgcgagagcg tcagtattaa gcgggggaga 600 
attagatcgc gatgggaaaa aattcggtta aggccagggg gaaagaaaaa atataaatta 660 
aaacatatag tatgggcaag cagggagcta gaacgattcg cagttaatcc tggcctgtta 720 
gaaacatcag aaggctgtag acaaatactg ggacagctac aaccatccct tcagacagga 780 
tcagaagaac ttagatcatt atataataca gtagcaaccc tctattgtgt gcatcaaagg 840 
atagagataa aagacaccaa ggaagcttta gacaagatag aggaagagca aaacaaaagt 900 
aagaccaccg cacagcaagc ggccgctgat cttcagacct ggaggaggag atatgaggga 960 
caattggaga agtgaattat ataaatataa agtagtaaaa attgaaccat taggagtagc 1020 
acccaccaag gcaaagagaa gagtggtgca gagagaaaaa agagcagtgg gaataggagc 1080 
tttgttcctt gggttcttgg gagcagcagg aagcactatg ggcgcagcgt caatgacgct 1140 
gacggtacag gccagacaat tattgtctgg tatagtgcag cagcagaaca atttgctgag 1200 
ggctattgag gcgcaacagc atctgttgca actcacagtc tggggcatca agcagctcca 1260 
ggcaagaatc ctggctgtgg aaagatacct aaaggatcaa cagctcctgg ggatttgggg 1320 
ttgctctgga aaactcattt gcaccactgc tgtgccttgg aatgctagtt ggagtaataa 1380 
atctctggaa cagatttgga atcacacgac ctggatggag tgggacagag aaattaacaa 1440 
ttacacaagc ttaatacact ccttaattga agaatcgcaa aaccagcaag aaaagaatga 1500 
acaagaatta ttggaattag ataaatgggc aagtttgtgg aattggttta acataacaaa 1560 
ttggctgtgg tatataaaat tattcataat gatagtagga ggcttggtag gtttaagaat 1620 
agtttttgct gtactttcta tagtgaatag agttaggcag ggatattcac cattatcgtt 1680 
tcagacccac ctcccaaccc cgaggggace cgacaggccc gaaggaatag aagaagaagg 1740. 
tggagagaga gacagagaca gatccattcg attagtgaac ggatctcgac ggtatcgata 1800 
agcttgggag ttccgcgtta cataacttac ggtaaatggc ccgcctggct gaccgcccaa 1860 
cgacccccgc ccattgacgt caataatgac gtatgttccc atagtaacgc caatagggac 1920 
tttccattga cgtcaatggg tggagtattt acggtaaact gcccacttgg cagtacatca 1980 
agtgtatcat atgccaagta cgccccctat tgacgtcaat gacggtaaat ggcccgcctg 2040 
gcattatgcc cagtacatga ccttatggga ctttcctact tggcagtaca tctacgtatt 2100 
agtcatcgct attaccatgg tgatgcggtt ttggcagtac atcaatgggc gtggatagcg 2160 
gtttgactca cggggatttc caagtctcca ccccattgac gtcaatggga gtttgttttg 2220 
gcaccaaaat caacgggact ttccaaaatg tcgtaacaac tccgccccat tgacgcaaat 2280 
gggcggtagg cgtgtacggt gggaggtcta tataagcaga gctcgtttag tgaaccgtca 234 Oy, 
gatcgcctgg agacgccatc cacgctgttt tgacctccat agaagacacc gactctagag 2400 
gatccactag tccagtgtgg tggaattgat cccttcacct aatacgactc actataggct 2460 
agctcctcct ctgctcgccc aatccttcca accccctatg gtggtatggc tgacacagaa 2520 
aatgtctgct cctgtatggg acatttgccc ctcttctcca aatataagac aggatgaggc 2580 
ctagcttttg ctgctccaaa gttttaaaag aacacattgc acggcattta gggactctaa 2640 
agggtggagg aggaatgagg gaattgcatc atgccaaggc tggtcctcat ccatcactgc 2700 
ttccagggcc cagagtggct tccaggaggt attcttacaa aggaagcccg atctgtagct 2760 
aacactcaga gcccattttc ctgcgttaac ccctcccgac ctcatataca ggagtaacat 2820 
gatcagtgac ctgggggagc tggccaaact gcgggacctg cccaagctga gggccttggt 2880 
gctgctggac aacccctgtg ccgatgagac tgactaccgc caggaggccc tggtgcagat 2940 
ggcacaccta gagcgcctag acaaagagta ctatgaggac gaggaccggg cagaagctga 3 000 
ggagatccga cagaggctga aggaggaaca ggagcaagaa ctcgacccgg accaagacat 3060 
ggaaccgtac ctcccgccaa cttagtggct cctctagcct gcagggacag taaaggtgat 3120 
ggcaggaagg cagcccccgg aggtcaaagg ctgggcacgc gggaggagag gccagagtca 3180 
gaggctgcgg gtatctcaga tatgaaggaa agatgagaga ggctcaggaa gaggtaagaa 3240 
aagacacaag agaccagaga agggagaaga attagagagg gaggcagagg accgctgtct 3300 
ctacagacat agctggtaga gactgggagg aagggatgaa ccctgagcgc atgaagggaa 3360 
gg a 9gt99 ct ggtggtatat ggaggatgta gctgggccag ggaaaagatc ctgcactaaa 3420 
aatctgaagc taaaaataac aggacacggg gtggagaggc gaaaggaggg cagattgagg 3480 
cagagagact gagaggcctg gggatgtggg cattccggta gggcacacag ttcacttgtc 3540 
ttctcttttt ccaggaggcc aragatgctg acctcaagaa ctcataatac cccagtgggg 3600 
accaccgcat tcatagccct gttacaagaa gtgggagatg ttcctttttg tcccagactg 3660 
gaaatccatt acatcccgag gctcaggttc tgtggtggtc atctctgtgt ggcttgttct 3720 
gtgggcctac ctaaagtcct aagcacagct ctcaagcaga tccgaggcga ctaagatgct 3780 
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agtaggggtt gtctggagag aagagccgag gaggtgggct gtgatggatc agttcagctt 3840 
tcaaataaaa aggcgttttt atattctgtg tcgagttcgt gaacccctgt ggtgggcttc 3900 
tccatctgtc tgggttagta cctgccacta tactggaata aggggacgcc tgcttccctc 3960 
gagttggctg gacaaggtta tgagcatccg tgtacttatg gggttgccag cttggtcctg 4020 
gatcgcccgg gcccttcccc cacccgttcg gttccccacc accacccgcg ctcgtacgtg 4080 
cgtctccgcc tgcagctctt gactcatcgg ggcccccggg tcacatgcgc tcgctcggct 4140 
ctataggcgc cgccccctgc ccaccccccg cccgcgctgg gagccgcagc cgccgccact 4200 
cctgctctct ctgcgccgcc gccgtcacca ccgccaccgc caccggctga gtctgcagtc 4260 
ctcgaggaac tgaaaaacca gaaagttaac tggtaagttt agtctttttg tcttttattt 4320 
caggtcccgg atcfcggtggt ggtgcaaatc aaagaactgc tcctcagtgg atgttgcctt 4380 
tacttctagg cctgtacgga agtgttactt ctgctctaaa agctgcggaa ttgtacccgc 4440 
ggccaagcta agcttgatat cgaattccgg atgagcctct gtgaactact aaggtgggag 4500 
ggggctatac gcagaggaga atgtcagatg ctcagctcgg tcccctccgc ctgacgctcc 4560 
tctctgtctc agccaggact ggtttctgta agaaacagca ggagctgtgg cagcggcgaa 4620 
aggaagcggc tgaggcgctt ggaacccgaa aagtctcggt gctcctggct acctcgcaca 4680 
gcggtgcccg cccggccgtc agtaccatgg acagcagcgc tgcccccacg aacgccagca 4740 
attgcactga tgccttggcg tactcaagtt gctccccagc acccagcccc ggttcctggg 4800 
tcaacttgtc ccacttagat ggcaacctgt ccgacccatg cggtccgaac cgcaccgacc 4 860 
tgggcgggag agacagcctg tgccctccga ccggcagtcc ctccatgatc acggccatca 4920 
cgatcatggc cctctactcc atcgtgtgcg tggtggggct cttcggaaac ttcctggtca 4980 
tgtatgtgat tgtcagatac accaagatga agactgccac caacatctac attttcaacc 5040 
ttgctctggc agatgcctta gccaccagta ccctgccctt ccagagtgtg aattacctaa 5100 
tgggaacatg gccatttgga accatccttt gcaagatagt gatctccata gattactata 5160 
acatgttcac cagcatattc accctctgca ccatgagtgt tgatcgatac attgcagtct 5220 
gccaccctgt caaggcctta gatttccgta ctccccgaaa tgccaaaatt atcaatgtct 5280 
gcaactggat cctctcttca gccattggtc ttcctgtaat gttcatggct acaacaaaat 5340 
acaggcaagg ttccatagat tgtacactaa cattctctca tccaacctgg tactgggaaa 5400 
acctgctgaa gatctgtgtt ttcatcttcg ccttcattat gccagtgctc atcattaccg 5460 
tgtgctatgg actgatgatc ttgcgcctca agagtgtccg catgctctct ggctccaaag 5520 
aaaaggacag gaatcttcga aggatcacca ggatggtgct ggtggtggtg gctgtgttca 5580 
tcgtctgctg gactcccatt cacatttacg tcatcattaa agccttggtt acaatcccag 5640 
aaactacgtt ccagactgtt tcttggcact tctgcattgc tctaggttac acaaacagct 5700 
gcctcaaccc agtcctttat gcatttctgg atgaaaactt caaacgatgc ttcagagagt 5760 
tctgtatccc aacctcttcc aacattgagc aacaaaactc cactcgaatt cgtcagaaca 5820 
tctagagacc acccctccac ggccaataca gtggatagaa ctaatcatca gctagaaaat 5880 
ctggaagcag aaactgctcc gttgcccgtc gacccgggcg gccgcttccc tttagtgagg 5940. 
gttaatgaag ggctcgagtc tagagggccc gcggttcgaa ggtaagccta tccctaaccc 60OO : 
tctcctcggt ctcgattcta cgcgtaccgg ttagtaatga gtttggaatt aattctgtgg 6060 
aatgtgtgtc agttagggtg tggaaagtcc ccaggctccc caggcaggca gaagtatgca 6120 
aagcatgcat ctcaattagt cagcaaccag gtgtggaaag tccccaggct ccccagcagg 6180 
cagaagtatg caaagcatgc atctcaatta gtcagcaacc atagtcccgc ccctaactcc 6240 
gcccatcccg cccctaactc cgcccagttc cgcccattct ccgccccatg gctgactaat 6300 
tttttttatt tatgcagagg ccgaggccgc ctctgcctct gagctattcc agaagtagtg 6360 
aggaggcttt tttggaggcc taggcttttg caaaaagctc ccgggag'ctt gtatatccat 6420 
tttcggatct gatcagcacg tgttgacaat taatcatcgg catagtatat cggcatagta 6480 
taatacgaca aggtgaggaa ctaaaccatg gccaagcctt tgtctcaaga agaatccacc 6540 
ctcattgaaa gagcaacggc tacaatcaac agcatcccca tctctgaaga ctacagcgtc 6600 
gccagcgcag ctctctctag cgacggccgc atcttcactg gtgtcaatgt atatcatttt 6660 
actgggggac cttgtgcaga actcgtggtg ctgggcactg ctgctgctgc ggcagctggc 6720 
aacctgactt gtatcgtcgc gatcggaaat gagaacaggg gcatcttgag cccctgcgga 6780 
cggtgccgac aggtgcttct cgatctgcat cctgggatca aagccatagt gaaggacagt 6840 
gatggacagc cgacggcagt tgggattcgt gaattgctgc cctctggtta tgtgtgggag 6900 
ggctaagcac aattcgagct cggtaccttt aagaccaatg acttacaagg cagctgtaga 6960 
tcttagccac tttttaaaag aaaagggggg actggaaggg ctaattcact cccaacgaag 7020 
acaagatctg ctttttgctt gtactgggtc tctctggtta gaccagatct gagcctggga 7080 
gctctctggc taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct 7140 
tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt 7200 
ttagtcagtg tggaaaatct ctagcagtag tagttcatgt catcttatta ttcagtattt 7260 
ataacttgca aagaaatgaa tatcagagag tgagaggaac ttgtttattg cage t tat aa 7320 
tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgea 7380 
ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctggc tctagctatc 7440 
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ccgcccctaa ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc 7500 
catggctgac taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta 7560 
ttccagaagt agtgaggagg cttttttgga ggcctaggga cgtacccaat tcgccctata 7620 
gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac tgggaaaacc 7680 
ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 7740 
gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatggg 7800 
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 7860 
ctacacttgc cagcgcccta 1 gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 7920 
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 7980 
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 8040 
catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 8100 
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 8160 
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 8220 
acgcgaattt taacaaaata ttaacgctta caatttaggt ggcacttttc ggggaaatgt 8280 
gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag 8340 
acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca 8400 
tttccgtgtc igcccttattc ccttttttgc ggca'ttttgc cttcctgttt fctgctcaccc 8460 
agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat 8520 
cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc 8580 
aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg 8640 
gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc 8700 
agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat 8760 
aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga 8820 
gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc 8880 
ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc 8940 
aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt 9000 
aatagactgg atggaggcgg ataaagttgc- aggaccaett ctgcgctcgg cccttccggc 9060 
tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc 9120 
agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca 9180 
ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca 9240 
ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt 9300 
ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta 9360 
acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg 9420 
agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc 9480. 
ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag 9540. 
cagagcgcag ataccaaata ctgttcttct agtgtagccg bagttaggcc accacttcaa 9600. 
gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc 9660 
cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc 9720 
gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta 9780 
caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag 9840 
aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct 9900 
tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga 9960 
gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc 1002a 
ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt 10080 
atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg 10140 
cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg 10200 
caaaccgcct ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc 10260 
cgactggaaa gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc 10320 
accccaggct ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata 10380 
acaatttcac acaggaaaca gctatgacca tgattacgcc aagcgcgcaa ttaaccctca 10440 
ctaaagggaa caaaagctgg agctgcaagc tt 10472 



<210> 49 
<211> 8889 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
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synthetic construct 
<400> 49 

ttggcccatt gcatacgttg tatccatatc 
caacattacc gccatgttga cattgattat 
ggtcattagt tcatagccca tatatggagt 
cgcctggctg accgcccaac gacccccgcc 
tagtaacgcc aatagggact ttccattgac 
cccacttggc agtacatcaa gtgtatcata 
acggtaaatg gcccgcctgg cattatgccc 
ggcagtacat ctacgtatta gtcatcgcta 
tcaatgggcg tggatagcgg tttgactcac 
tcaatgggag tttgttttgg caccaaaatc 
ccgccccatt gacgcaaatg ggcggtaggc 
ctcgtttagt gaaccgtcag atcgcctgga 
gaagacaccg ggaccgatcc agcctcccct 
cctgagaact tcagggtgag ■ tctatgggac 
tggttaagtt catgtcatag gaaggggaga 
gggtaatttt gcatttgtaa ttttaaaaaa 
tatcttattt ctaatacttt ccctaatctc 
tcatgcctct ttgcaccatt ctaaagaata 
caatatttct gcatataaat atttctgcat 
attgctaata gcagctacaa tccagctacc 
gctggattat tctgagtcca agctaggccc 
tcctcccaca gctcctgggc aacgtgctgg 
cacgtgagat ctgaattcga gatctgccgc 
cgggggagaa ttagatcgat gggaaaaaat 
taaattaaaa catatagtat gggcaagcag 
cctgttagaa acatcagaag gctgtagaca. 
gacaggatca gaagaactta gatcattata 
tcaaaggata gagataaaag acaccaagga 
caaaagtaag aaaaaagcac agcaagcagc 
ccaaaattac cctatagtgc agaacatcca 
tagaacttta aatgcatggg taaaagtagt 
acccatgttt tcagcattat cagaaggagc 
cacagtgggg ggacatcaag cagccatgca 
tgcagaatgg gatagagtgc atccagtgca 
agaaccaagg ggaagtgaca tagcaggaac 
gatgacacat aatccaccta tcccagtagg 
attaaataaa atagtaagaa tgtatagccc 
aaaggaaccc tttagagact atgtagaccg 
ttcacaagag gtaaaaaatt ggatgacaga 
ttgtaagact attttaaaag cattgggacc 
atgtcaggga gtggggggac ccggccataa 
agtaacaaat ccagctacca taatgataca 
tgttaagtgt ttcaattgtg gcaaagaagg 
gaaaaagggc tgttggaaat gtggaaagga 
acaggctaat tttttaggga agatctggcc 
tcagagcaga ccagagccaa cagccccacc 
aacaactccc tctcagaagc aggagccgat 
cagatcactc tttggcagcg acccctcgtc 
gctctattag atacaggagc agatgataca 
tggaaaccaa aaatgatagg gggaattgga 
atactcatag aaatctgcgg acataaagct 
gtcaacataa ttggaagaaa tctgttgact 
agtcctattg agactgtacc agtaaaatta 
caatggccat tgacagaaga aaaaataaaa 
aaggaaggaa aaatttcaaa aattgggcct 
ataaagaaaa aagacagtac taaatggaga 
agaactcaag atttctggga agttcaatta 
aaaaaatcag taacagtact ggatgtgggc 



ataatatgta catttatatt ggctcatgtc 60 
tgactagtta ttaatagtaa tcaattacgg 120 
tccgcgttac ataacttacg gtaaatggcc 180 
cattgacgtc aataatgacg tatgttccca 240 
gtcaatgggt ggagtattta cggtaaactg 300 
tgccaagtac gccccctatt gacgtcaatg 360 
agtacatgac cttatgggac tttcctactt 420 
ttaccatggt gatgcggttt tggcagtaca 480 
ggggatttcc aagtctccac cccattgacg 540 
aacgggactt tccaaaatgt cgtaacaact 600 
gtgtacggtg ggaggtctat ataagcagag 660 
gacgccatcc acgctgtttt gacctccata 720 
cgaagcttac atgtggtacc gagctcggat 780 
ccttgatgtt ttctttcccc ttcttttcta 840 
agtaacaggg tacacatatt gaccaaatca 900 
tgctttcttc ttttaatata cttttttgtt 960 
tttctttcag ggcaataatg atacaatgta 1020 
acagtgataa tttctgggtt aaggcaatag 1080 
ataaattgta actgatgtaa gaggtttcat 1140 
attctgcttt tattttatgg ttgggataag 1200 
ttttgctaat catgttcata cctcttatct 1260 
tctgtgtgct ggcccatcac tttggcaaag 1320 
cgccatgggt gcgagagcgt cagtattaag 1380 
tcggttaagg ccagggggaa. agaaaaaata 1440 
ggagctagaa cgattcgcag ttaatcctgg 1500 
aatactggga cagctacaac catcccttca 1560 
taatacagta gcaaccctct attgtgtgca 162 0 
agctttagac aagatagagg aagagcaaaa 168.0 
agctgacaca ggacacagca atcaggtcag 1740 
ggggcaaatg gtacatcagg ccatatcacc 1800 
agaagagaag gctttcagcc cagaagtgat 1860. 
caccccacaa gatttaaaca ccatg[ctaaa 192 0-. 
aatgttaaaa gagaccatca atgaggaagc 1980. 
tgcagggcct attgcaccag gccagatgag 2040. 
tactagtacc cttcaggaac aaataggatg 2100 
agaaatctat aaaagatgga taatcctggg 2160 
taccagcatt ctggacataa gacaaggacc 2220 
attctataaa actctaagag ccgagcaagc 2280 
aaccttgttg gtccaaaatg cgaacccaga 2340 
aggagcgaca ctagaagaaa tgatgacagc 2400 
agcaagagtt ttggctgaag caatgagcca 2460 
gaaaggcaat tttaggaacc aaagaaagac 2520 
gcacatagcc aaaaattgca gggcccctag 2580 
aggacaccaa atgaaagatt gtactgagag 2640 
ttcccacaag ggaaggccag ggaattttct 2700 
agaagagagc ttcaggtttg gggaagagac 2760 
agacaaggaa ctgtatcctt tagcttccct 2820 
acaataaaga taggggggca attaaaggaa 2880 
gtattagaag aaatgaattt gccaggaaga 2940 
ggttttatca aagtaagaca gtatgatcag 3000 
ataggtacag tattagtagg acctacacct 3060 
cagattggct gcactttaaa ttttcccatt 3120 
aagccaggaa tggatggccc aaaagttaaa 3180 
gcattagtag aaatttgtac agaaatggaa 3240 
gaaaatccat acaatactcc agtatttgcc 3300 
aaattagtag atttcagaga acttaataag* 3360 
ggaataccac atcctgcagg gttaaaacag 3420 
gatgcatatt tttcagttcc cttagataaa 3480 
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gacttcagga agtatactgc atttaccata cctagtataa acaatgagac accagggatt 3540 
agatatcagt acaatgtgct tccacaggga tggaaaggat caccagcaat attccagtgt 3600 
agcatgacaa aaatcttaga gccttttaga aaacaaaatc cagacatagt catctatcaa 3660 
tacatggatg atttgtatgt aggatctgac ttagaaatag ggcagcatag aacaaaaata 3720 
gaggaactga gacaacatct gttgaggtgg ggatttacca caccagacaa aaaacatcag 3780 
aaagaacctc cattcctttg gatgggttat gaactccatc ctgataaatg gacagtacag 3840 
cctatagtgc tgccagaaaa ggacagctgg actgtcaatg acatacagaa attagtggga 3900 
aaattgaatt gggcaagtca gatttatgca gggattaaag taaggcaatt atgtaaactt 3960 
cttaggggaa ccaaagcact aacagaagta gtaccactaa cagaagaagc agagctagaa 4020 
ctggcagaaa acagggaga.t tctaaaagaa ccggtacatg gagtgtatta tgacccatca 4080 
aaagacttaa tagcagaaat acagaagcag gggcaaggcc aatggacata tcaaatttat 4140 
caagagccat ttaaaaatct gaaaacagga aagtatgcaa gaatgaaggg tgcccacact 4200 
aatgatgtga aacaattaac agaggcagta caaaaaatag ccacagaaag catagtaata 4260 
tggggaaaga ctcctaaatt taaattaccc atacaaaagg aaacatggga agcatggtgg 4320 
acagagtatt ggcaagccac ctggattcct gagtgggagt ttgtcaatac ccctccctta 4380 
gtgaagttat ggtaccagtt agagaaagaa cccataatag gagcagaaac tttctatgta 4440 
gatggggcag ccaataggga aactaaatta ggaaaagcag gatatgtaac tgacagagga 4500 
agacaaaaag ttgtccccct aacggacaca acaaatcaga agactgagtt acaagcaatt 4560 
catctagctt tgcaggattc gggattagaa gtaaacatag tgacagactc acaatatgca 4620 
ttgggaatca ttcaagcaca accagataag agtgaatcag agttagtcag tcaaataata 4680 
gagcagttaa taaaaaagga aaaagtctac ctggcatggg taccagcaca caaaggaatt 474 0 
ggaggaaatg aacaagtaga taaattggtc agtgctggaa tcaggaaagt actattttta 4800 
gatggaatag ataaggccca agaagaacat gagaaatatc acagtaattg gagagcaatg 4860 
gctagtgatt ttaacctacc acctgtagta gcaaaagaaa tagtagccag ctgtgataaa 4920' 
tgtcagctaa aaggggaagc catgcatgga caagtagact gtagcccagg aatatggcag 4980 
ctagattgta cacatttaga aggaaaagtt atcttggtag cagttcatgt agccagtgga 5040 
tatatagaag cagaagtaat tccagcagag acagggcaag aaacagcata cttcctctta 5100. 
aaattagcag gaagatggcc agtaaaaaca gtacatacag acaatggcag caatttcacc 5160 
agtactacag ttaaggccgc ctgttggtgg gcggggatca agcaggaatt tggcattccc 5220 
tacaatcccc aaagtcaagg agtaatagaa tctatgaata aagaattaaa gaaaattata 5280 
ggacaggtaa gagatcaggc tgaacatctt aagacagcag tacaaatggc agtattcatc 5340 
cacaatttta aaagaaaagg ggggattggg gggtacagtg caggggaaag aatagtagac 5400 
ataatagcaa cagacataca aactaaagaa ttacaaaaac aaattacaaa aattcaaaat 5460 
tttcgggttt attacaggga cagcagagat ccagtttgga aaggaccagc aaagctcctc 5520 
tggaaaggtg aaggggcagt agtaatacaa gataatagtg acataaaagt agtgccaaga 55 80-. 
agaaaagcaa agatcatcag ggattatgga aaacagatgg caggtgatga ttgtgtggca 5640- 
agtagacagg atgaggatta acacatggaa ttccggagcg gccgcaggag ctttgttcct 5700 
tgggttcttg ggagcagcag gaagcactat gggcgcagcg tcaatgacgc tgacggtaca 5760 
ggccagacaa ttattgtctg gtatagtgca gcagcagaac aatttgctga gggctattga 5820 
ggcgcaacag catctgttgc aactcacagt ctggggcatc aagcagctcc aggcaagaat 5880 
cctggctgtg gaaagatacc taaaggatca acagctcctg gggatttggg gttgctctgg 5940 
aaaactcatt tgcaccactg ctgtgccttg gaatgctagt tggagtaata aatctctgga 6000 
acagatttgg aatcacacga cctggatgga gtgggacag*. gaaattaaca attacacaag 6060 
cttccgcgga attcacccca ccagtgcagg ctgcctatca gaaagtggtg gctggtgtgg 6120 
ctaatgccct ggcccacaag tatcactaag ctcgctttct tgctgtccaa tttctattaa 6180 
aggttccttt gttccctaag tccaactact aaactggggg atattatgaa gggccttgag 6240 
catctggatt ctgcctaata aaaaacattt attttcattg caatgatgta tttaaattat 6300 
ttctgaatat tttactaaaa agggaatgtg ggaggtcagt gcatttaaaa cataaagaaa 6360 
tgaagagcta gttcaaacct tgggaaaata cactatatct taaactccat gaaagaaggt 6420 
gaggctgcaa acagctaatg cacattggca acagcccctg atgcctatgc cttattcatc 6480 
cctcagaaaa ggattcaagt agaggcttga tttggaggtt aaagttttgc tatgctgtat 6540 
tttacattac ttattgtttt agctgtcctc atgaatgtct tttcactacc catttgctta 6600 
tcctgcatct ctcagccttg actccactca gttctcttgc ttagagatac cacctttccc 6660 
ctgaagtgtt ccttccatgt tttacggcga gatggtttct cctcgcctgg ccactcagcc 6720 
ttagttgtct ctgttgtctt atagaggtct acttgaagaa ggaaaaacag ggggcatggt 6780 
ttgactgtcc tgtgagccct tcttccctgc ctcccccact cacagtgacc cggaatccct 6840 
cgacatggca gtctagcact agtgcggccg cagatctgct tcctcgctca ctgactcgct 6900 
gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 6960 
atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 7020 
caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 7080 
gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 7140 
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ccaggcgttt ccccctggaa gctccctcgt 
cggatacctg tccgcctttc tcccttcggg 
taggtatctc agttcggtgt aggtcgttcg 
cgttcagccc gaccgctgcg ccttatccgg 
acacgactta tcgccactgg cagcagccac 
aggcggtgct acagagttct tgaagtggtg 
atttggtatc tgcgctctgc tgaagccagt 
atccggcaaa caaaccaccg ctggtagcgg 
gcgcagaaaa aaaggatctc aagaagatcc 
gtggaacgaa aactcacgtt aagggatttt 
ctagatcctt ttaaattaaa aatgaagttt 
ttggtctgac agttaccaat gcttaatcag 
tcgttcatcc atagttgcct gactccccgt 
accatctggc cccagtgctg caatgatacc 
atcagcaata aaccagccag ccggaagggc 
cgcctccatc cagtctatta attgtt'gccg 
tagtttgcgc aacgttgttg ccattgctac 
tatggcttca ttcagctccg gttcccaacg 
gtgcaaaaaa gcggttagct ccttcggtcc 
agtgttatca ctcatggtta tggcagcact 
aagatgcttt tctgtgactg gtgagtactc 
gcgaccgagt tgctcttgcc cggcgtcaat 
tttaaaagtg ctcatcattg gaaaacgttc 
gctgttgaga tccagttcga tgtaa.cccac 
tactttcacc agcgtttctg ggtgagcaaa 
aataagggcg acacggaaat gtt'gaatact 
■ catt-tatcag ggttattgtc tcatgagcgg 
acaaataggg gttccgcgca catttccccg 
gggcccccat gggctagagg atccggcctc 
gccatgagc 



gcgctctcct gttccgaccc tgccgcttac 7200 
aagcgtggcg ctttctcata gctcacgctg 7260 
ctccaagctg ggctgtgtgc acgaaccccc 7320 
taactatcgt cttgagtcca acccggtaag 7380 
tggtaacagg attagcagag cgaggtatgt 7440 
gcctaactac ggctacacta gaagaacagt 7500 
taccttcgga aaaagagttg gtagctcttg 7560 
tggttttttt gtttgcaagc agcagattac 7620 
tttgatcttt tctacggggt ctgacgctca 7680 
ggtcatgaga ttatcaaaaa ggatcttcac 7740 
taaatcaatc taaagtatat atgagtaaac 7800 
tgaggcacct atctcagcga tctgtctatt 7860 
cgtgtagata actacgatac gggagggctt 7920 
gcgagaccca cgctcaccgg ctccagattt 7980 
cgagcgcaga agtggtcctg caactttatc 8040 
ggaagctaga gtaagtagtt cgccagttaa 8100 
aggcatcgtg gtgtcacgct cgtcgtttgg 8160 
atcaaggcga gttacatgat cccccatgtt 8220 
tccgatcgtt gtcagaagta agttggccgc 8280 
gcataattct cttactgtca tgccatccgt 8340 
aaccaagtca ttctgagaat agtgtatgcg 8400 
acgggataat accgcgccac atagcagaac 8460 
ttcggggcga aaactctcaa ggatcttacc 852 0 
tcgtgcaccc aactgatctt cagcatcttt 8580 
aacaggaagg caaaatgccg caaaaaaggg 8640 
catactcttc ctttttcaat attattgaag 8700 
atacatattt gaatgtattt agaaaaataa. 8760 
aaaagtgcca cctgacggga tcccctgagg 8820 
ggcctctgca taaataaaaa aaattagtca 8880 

8889 



<210> 50 
<211> 4180 
<212> DNA ' 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 50 

aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca 60 
tgccttacaa ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga 120 
tcgtgcctta ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt 180 
ccgcattgca gagatattgt atttaagtgc ctagctcgat acaataaacg ccatttgacc 240 
attcaccaca ttggtgtgca cctccaagct cgagctcgtt tagtgaaccg tcagatcgcc 300 
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 360 
ccctcgaagc tagtcgatta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa 420 
gacctcctca aggcagtcag actcatcaag tttctctatc aaagcaaccc acctcccaat 480 
cccgagggga cccgacaggc ccgaaggaat agaagaagaa ggtggagaga gagacagaga 540 
cagatccatt cgattagtga acggatcctt agcacttatc tgggacgatc tgcggagcct 600 
gtgcctcttc agctaccacc gcttgagaga cttactcttg attgtaacga ggattgtgga 660 
acttctggga cgcagggggt gggaagccct caaatattgg tggaatctcc tacaatattg 720 
gagtcaggag ctaaagaata gtgctgttag cttgctcaat gccacagcta tagcagtagc 780 
tgaggggaca gatagggtta tagaagtagt acaagaagct tggcactggc cgtcgtttta 840 
caacgtcgtg atctgagcct gggagatctc tggctaacta gggaacccac tgcttaagcc 900 
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 960 
taactagaga tcaggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 1020 
ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 1080 
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gcagcctgaa tggcgaatgg cgcctgatgc ggtattttct cct'tacgcat ctgtgcggta 1140 
tttcacaccg catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc 1200 
ggcgggtgtg gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc 1260 
tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 1320 
aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 1380 
acttgatttg ggtgatggtt cacgtagtgg gccatcgqcc tgatagacgg tttttcgccc 1440 
tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 1500 
caaccctatc tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg 1560 
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt 1620 
tacaatttta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 1680 
ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 1740 
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 1800 
accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 1860 
gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 1920 
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 1980 
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 2040 
ccttattccc ttttttgcgg- cattttgcct tcctgttttt gctcacccag aaacgctggt 2100 
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 2160 
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 2220 
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 2280 
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 2340 
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 2400 
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 2460 
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 2520 
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 2580 
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 2640 
ggaggcggat aaagttgcag gaecacttct gcgctcggcc cttccggctg gctggtttat 2700 
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 2760 
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 2820 
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 2880 
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 2940 
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 3000 
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 3060 
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 3120 
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 3180 
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 3240 
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 3300; 
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 3360 
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 3420 
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 3480 
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 3540 
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 3600 
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 3660 
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 3720 
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 3780 
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 3840 
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 3900 
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 3960 
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 4020 
aggaaacagc tatgacatga ttacgaattc gatgtacggg ccagatatac gcgtatctga 4080 
ggggactagg gtgtgtttag gcgaaaagcg gggcttcggt tgtacgcggt taggagtccc 4140 
ctcaggatat agtagtttcg cttttgcata gggaggggga 4180 

<210> 51 

<211> 5821 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note »■ 
synthetic construct. 
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<400> 51 

ttggcccatt gcatacgttg tatccatatc 
caacattacc gccatgttga cattgattat 
ggtcattagt tcatagccca tatatggagt 
cgcctggctg accgcccaac gacccccgcc 
tagtaacgcc aatagggact ttccattgac 
cccacttggc agtacatcaa gtgtatcata 
acggtaaatg gcccgcctgg cattatgccc 
ggcagtacat ctacgtatta gtcatcgcta 
tcaatgggcg tggatagcgg tttgactcac 
tcaatgggag tttgttttgg caccaaaatc 
ccgccccatt gacgcaaatg ggcggtaggc 
ctcgtttagt gaaccgtcag atcgcctgga 
gaagacaccg ggaccgatcc agcctcccct 
cctgagaact tcagggtgag tctatgggac 
tggttaagtt catgtcatag gaaggggaga- 
gggtaatttt gcatttgtaa ttttaaaaaa 
tatcttattt ctaatacttt ccctaatctc 
tcatgcctct ttgcaccatt ctaaagaata 
caatatttct gcatataaat atttctgcat 
attgctaata gcagctacaa tccagctacc 
gctggattat tctgagtcca agctaggccc 
tcctcccaca gctcctgggc aacgtgctgg 
cacgtgagat ctgaattctg acactatgaa 
tggggtgaat tgcaagttca ccatagtttt 
tgttccttct aattaccatt attgcccgtc 
aataggcaca gccttacaag tcaaaatgcc 
ttggatgtgt catgcttcca aatgggtcac 
gtatataaca cattccatcc gatccttcac 
tgaacaaacg aaacaaggaa cttggctgaa 
tgcaactgtg acggatgccg aagcagtgat 
tgatgaatac acaggagaat gggttgattc 
catatgcccc actgtccata ac.tctacaac 
atgtgattct aacctcattt ccatggacat 
atccctggga aaggagggca cagggttcag 
caaggcctgc aaaatgcaat actgcaagca 
gttcgagatg gctgataagg atctctttgc 
gtcaagtatc tctgctccat ctcagacctc 
gaggatcttg gattattccc tctgccaaga 
aatctctcca gtggatctca gctatcttgc 
caccataatc aatggtaccc taaaatactt 
tgctccaatc ctctcaagaa tggtcggaat 
gtgggatgac tgggcaccat atgaagacgt 
cagttcagga tataagtttc ctttatacat 
tcatcttagc tcaaaggctc aggtgttcga 
acttcctgat gatgagagtt tattttttgg 
gcttgtagaa ggttggttca gtagttggaa 
agggttaatc attggactat tcttggttct 
aaagcacacc aagaaaagac agatttatac 
actcaaatcc tgcacaacag attcttcatg 
tcaaagaggc ctcaattata tttgagtttt 
gaattcaccc caccagtgca ggctgcctat 
ctggcccaca agtatcacta agctcgcttt 
ttgttcccta agtccaacta ctaaactggg 
ttctgcctaa taaaaaacat ttattttcat 
attttactaa aaagggaatg tgggaggtca 
tagttcaaac cttgggaaaa tacactatat 
aaacagctaa tgcacattgg caacagcccc 
aaggattcaa gtagaggctt gatttggagg 
acttattgtt ttagctgtcc tcatgaatgt 



ataatatgta catttatatt ggctcatgtc 60 
tgactagtta ttaatagtaa tcaattacgg 120 
tccgcgttac ataacttacg gtaaatggcc 180 
cattgacgtc aataatgacg tatgttccca 240 
gtcaatgggt ggagtattta cggtaaactg 300 
tgccaagtac gccccctatt gacgtcaatg 360 
agtacatgac cttatgggac tttcctactt 420 
ttaccatggt gatgcggttt tggcagtaca 480 
ggggatttcc aagtctccac cccattgacg 540 
aacgggactt . tccaaaatgt cgtaacaact 600 
gtgtacggtg ggaggtctat ataagcagag 660 
gacgccatcc acgctgtttt gacctccata 720 
cgaagcttac atgtggtacc gagctcggat 780 
ccttgatgtt ttctttcccc ttcttttcta 840 
agtaacaggg tacacatatt gaccaaatca 900 
tgctttcttc ttttaatata cttttttgtt 960 
tttctttcag ggcaataatg atacaatgta 1020 
acagtgataa tttctgggtt aaggcaatag 1080 
ataaattgta actgatgtaa gaggtttcat 1140 
attctgcttt tattttatgg ttgggataag 1200 
ttttgctaat catgttcata cctcttatct 1260 
tctgtgtgct ggcccatcac tttggcaaag 1320 
gtgccttttg tacttagcct ttttattcat 1380 
tccacacaac caaaaaggaa actggaaaaa 1440 
aagctcagat ttaaattggc ataatgactt 1500 
caagagtcac aaggctattc aagcagacgg 1560 
tacttgtgat ttccgctggt atggaccgaa 1620 
tccatctgta gaacaatgca aggaaagcat 1680 
tccaggcttc cctcctcaaa gttgtggata 1740 
tgtccaggtg actcctcacc atgtgctggt 1800 
acagttcatc aacggaaaat gcagcaatta 1860 
ctggcattct gactataagg tcaaagggct 1920 
caccttcttc tcagaggacg gagagctatc 1980 
aagtaactac tttgcttatg aaactggagg 2040 
ttggggagtc agactcccat caggtgtctg 2100, 
tgcagccaga ttccctgaat gcccagaagg 2160 
agtggatgta agtctaattc aggacgttga 2220 
aacctggagc aaaatcagag cgggtcttcc 2280 
tcctaaaaac ccaggaaccg gtcctgcttt 2340 
tgagaccaga tacatcagag tcgatattgc 2400 
gatcagtgga actaccacag aaagggaact 2460 
ggaaattgga cccaatggag ttctgaggac 2520 
gattggacat ggtatgttgg actccgatct 2580 
acatcctcac attcaagacg ctgcttcgca 2640 
tgatactggg ctatccaaaa atccaatcga 2700 
aagctctatt gcctcttttt tctttatcat 2760 
ccgagttggt atccatcttt gcattaaatt 2820 
agacatagag atgaaccgac ttggaaagta 2880 
tttggaccaa atcaacttgt gataccatgc 2940 
taatttttat gaaaaaaaaa aaaaaaaacg 3000 
cagaaagtgg tggctggtgt ggctaatgcc 3060 
cttgctgtcc aatttctatt aaaggttcct 3120 
ggatattatg aagggccttg agcatctgga 3180 
tgcaatgatg tatttaaatt atttctgaat 3240 
gtgcatttaa aacataaaga aatgaagagc 3300 
cttaaactcc atgaaagaag gtgaggctgc 3360 
tgatgcctat gccttattca tccctcagaa 3420 
ttaaagtttt gctatgctgt attttacatt 3480 
cttttcacta cccatttgct tatcctgcat 3540 
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ctctcagcct tgactccact cagttctctt gcttagagat accacctttc ccctgaagtg 3600 
ttccttccat gttttacggc gagatggttt ctcctcgcct ggccactcag ccttagttgt 3660 
ctctgttgtc ttatagaggt ctacttgaag aaggaaaaac agggggcatg gtttgactgt 3720 
cctgtgagcc cttcttccct gcctccccca ctcacagtga cccggaatcc ctcgacatgg 3780 
cagtctagca ctagfcgcggc cgcagatctg cttcctcgct cactgactcg ctgcgctcgg 3840 
tcgttcggct gcggcgag.cg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 3900 
aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 3960 
gtaaaaaggc cgcgttgctg gcgtt-tttcc ataggctccg cccccctgac gagcatcaca 4020 
aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actabaaaga taccaggcgt 4080 
ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 4140 
tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 4200 
tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 4260 
ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 4320 
tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 4380 
ctacagagtt cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta 4440 
tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 4500 
aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 4560 
aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 4620 
aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 4680 
ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 4740 
acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 4800 
ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 4860 
gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 4920 
taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 4980 
tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 5040 
gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 5100 
cattcagctc cggttcccaa cgatcaaggc gagttacatg atc.ccccatg ttgtgcaaaa 5160 
aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat 5220 
cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct 5280 
tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga 5340 
gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag 5400 
tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 5460 
gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 5520 
ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 5580 
cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 5640 
agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 5700 
gggttccgcg cacatttccc cgaaaagtgc cacctgacgg gatcccctga gggggccccc 5760 
atgggctaga ggatccggcc tcggcctctg cataaataaa aaaaattagt cagccatgag 5820 
c. ' • 5821 



<210> 52 
<211> 1986 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 52 

tcctcctctg ctcgcccaat ccttccaacc ccctatggtg gtatggctga cacagaaaat 60 
gtctgctcct gtatgggaca tttgcccctc ttctccaaat ataagacagg atgaggccta 120 
gcttttgctg ctccaaagtt ttaaaagaac acattgcacg gcatttaggg actctaaagg 180 
gtggaggagg aatgagggaa ttgcatcatg ccaaggctgg tcctcatcca tcactgcttc 240 
cagggcccag agtggcttcc aggaggtatt cttacaaagg aagcccgatc tgtagctaac 300 
actcagagcc cattttcctg cgttaacccc tcccgacctc atatacagga gtaacatgat 360 
cagtgacctg ggggagctgg ccaaactgcg ggacctgccc aagctgaggg ccttggtgct 420 
gctggacaac ccctgtgccg atgagactga ctaccgccag gaggccctgg tgcagatggc 480 
acacctagag cgcctagaca aagagtacta tgaggacgag gaccgggcag aagctgagga 540 
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gatccgacag aggctgaagg aggaacagga gcaagaactc gacccggacc aagacatgga 600 
accgtacctc ccgccaactt agtggctcct ctagcctgca gggacagtaa aggtgatggc 660 
aggaaggcag cccccggagg tcaaaggctg ggcacgcggg aggagaggcc agagtcagag 720 
gctgcgggta tctcagatat gaaggaaaga tgagagaggc tcaggaagag gtaagaaaag 780 
acacaagaga ccagagaagg gagaagaatt agagagggag gcagaggacc gctgtctcta 840 
cagacatagc tggtagagac tgggaggaag ggatgaaccc tgagcgcatg aagggaagga 900 
ggtggctggt ggtatatgga ggatgtagct gggccaggga aaagatcctg cactaaaaat 960 
ctgaagctaa aaataacagg acacggggtg gagaggcgaa aggagggcag attgaggcag 1020 
agagactgag aggcctgggg atgtgggcat tccggtaggg cacacagttc acttgtcttc 1080 
tctttttcca ggaggccara gatgctgacc tcaagaactc ataatacccc agtggggacc 1140 
accgcattca tagccctgtt acaagaagtg ggagatgttc ctttttgtcc cagactggaa 1200 
atccattaca tcccgaggct caggttctgt ggtggtcatc tctgtgtggc ttgttctgtg 1260 
ggcctaccta aagtcctaag cacagctctc aagcagatcc gaggcgacta agatgctagt 132 0 
aggggttgtc tggagagaag agccgaggag gtgggctgtg atggatcagt tcagctttca 1380 
aataaaaagg cgtttttata ttctgtgtcg agttcgtgaa cccctgtggt gggcttctcc 1440 
atctgtctgg gttagtacct gccactatac tggaataagg ggacgcctgc ttccctcgag 1500 
ttggctggac aaggttatga gcatccgtgt* acttatgggg ttgccagctt ggtcctggat 1560 
cgcccgggcc cttcccccac ccgttcggtt ccccaccacc acccgcgctc gtacgtgcgt 1620 
ctccgcctgc agctcttgac tcatcggggc ccccgggtca catgcgctcg ctcggctcta 1680 
taggcgccgc cccctgccca ccccccgccc gcgctgggag ccgcagccgc cgccactcct 1740 
gctctctctg cgccgccgcc gtcaccaccg ccaccgccac cggctgagtc tgcagtcctc 1800 
gaggaactga aaaaccagaa agttaactgg taagtttagt ctttttgtct tttatttcag 1860 
gtcccggatc cggtggtggt gcaaatcaaa gaactgctcc tcagtggatg ttgcctttac 1920 
ttctaggcct gtacggaagt gttacttctg ctctaaaagc tgcggaattg tacccgcggc 1980 
caagct 1986 1 

<210> 53 

<211> 1610 ... 
<212> DNA 

<213> Artificial Sequence 

<220> v 
<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 53 

cggatgagcc tctgtgaact actaaggtgg gagggggcta tacgcagagg agaatgtcag 60 
atgctcagct cggtcccctc cgcctgacgc tcctctctgt ctcagccagg actggtttct 120 
gtaagaaaca gcaggagctg tggcagcggc gaaaggaagc ggctgaggcg cttggaaccc 180 
gaaaagtctc ggtgctcctg gctacctcgc acagcgtgcc cgcccggccg tcagtaccat 240 
ggacagcagc gctgccccca cgaacgccag caattgcact gatgccttgg cgtactcaag 300 
ttgctcccca gcacccagcc ccggttcctg ggtcaacttg tcccacttag atggcaacct 360 
gtccgaccca tgcggtccga accgcaccga cctgggcggg agagacagcc tgtgccctcc 420 
gaccggcagt ccctccatga tcacggccat cacgatcatg gccctctact ccatcgtgtg 480 
cgtggtgggg ctcttcggaa acttcctggt catgtatgtg attgtcagat acaccaagat 540 
gaagactgcc accaacatct acattttcaa ccttgctctg gcagatgcct tagccaccag 600 
taccctgccc ttccagagtg tgaattacct aatgggaaca tggccatttg gaaccatcct 660 
ttgcaagata gtgatctcca tagattacta taacatgttc accagcatat tcaccctctg 720 
caccatgagt gttgatcgat acattgcagt ctgccaccct gtcaaggcct tagatttccg 780 
tactccccga aatgccaaaa ttatcaatgt ctgcaactgg atcctctctt cagccattgg 840 
tcttcctgta atgttcatgg ctacaacaaa atacaggcaa ggttccatag attgtacact 900 
aacattctct catccaacct ggtactggga aaacctgctg aagatctgtg ttttcatctt 960 
cgccttcatt atgccagtgc tcatcattac cgtgtgctat ggactgatga tcttgcgcct 1020 
caagagtgtc cgcatgctct ctggctccaa agaaaaggac aggaatcttc gaaggatcac 1080 
caggatggtg ctggtggtgg tggctgtgtt catcgtctgc tggactccca ttcacattta 1140 
cgtcatcatt aaagccttgg ttacaatccc agaaactacg ttccagactg tttcttggca 1200 
cttctgcatt gctctaggtt acacaaacag ctgcctcaac ccagtccttt atgcatttct 1260 
ggatgaaaac ttcaaacgat gcttcagaga gttctgtatc ccaacctctt ccaacattga 1320 
gcaacaaaac tccactcgaa ttcgtcagaa cactagagac cacccctcca cggccaatac 1380 
agtggataga actaatcatc agctagaaaa tctggaagca gaaactgctc cgttgcccta 1440 
acagggtctc atgccattcc gaccttcacc aagcttagaa gccaccatgt atgtggaagc 1500 
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aggttgcttc aagaatgtgt aggaggctct aattctctag gaaagtgcct gcttttaggt 1560 
catccaacct ctttcctctc tggccactct gctctgcaca ttagaggccg 1610 

<210> 54 
<211> 1536 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note ?= 

synthetic construct • 



<400> 54 

atgaagtgcc 

gtttttccac 

ccgtcaagct 

atgcccaaga 

gtcactactt 

ttcactccat 

ctgaatccag 

gtgattgtcc 

gattcacagt 

acaacctggc 

gacatcacct 

ttcagaagta 

aagcattggg 

tttgctgcag 

acctcagtgg 

caagaaacct 

cttgctccta 

tactttgaga 

ggaatgatca 

gacgtggaaa 

tacatgattg 

ttcgaacatc 

tttggtgata 

tggaaaagct 

gttctccgag 

tatacagaca 



ttttgtactt 
acaaccaaaa 
cagatttaaa 
gtcacaaggc 
gtgatttccg 
ctgtagaaca 
gcttccctcc 
aggtgactcc 
tcatcaacgg 
attctgacta 
tcttctcaga 
actactttgc 
gagtcagact 
ccagattccc 
atgtaagtct 
ggagcaaaat 
aaaacccagg 
ccagatacat 
gtggaactac 
ttggacccaa 
gacatggtat 
ctcacattca 
ctgggctatc 
ctattgcctc 
ttggtatcca 
tagagatgaa 



agccttttta 
aggaaactgg 
ttggcataat 
tattcaagca 
ctggtatgga 
atgcaaggaa 
tcaaagttgt 
tcaccatgtg 
aaaatgcagc 
taaggtcaaa 
ggacggagag 
ttatgaaact 
cccatcaggt 
tgaatgccca 
aattcaggac 
cagagcgggt 
aaccggtcct 
cagagtcgat 
cacagaaagg 
tggagttctg 
gttggactcc 
agacgctgct 
caaaaatcca 
ttttttcttt 
tctttgcatt 
ccgacttgga 



ttcattgggg 
aaaaatgttc 
gacttaatag 
gacggttgga 
ccgaagtata 
agcattgaac 
ggatatgcaa 
ctggttgatg 
aattacatat 
gggctatgtg 
ctatcatccc 
ggaggcaagg 
gtctggttcg 
gaagggtcaa 
gttgagagga 
cttccaatct 
gctttcacca 
attgctgctc 
gaactgtggg 
aggaccagtt 
gatcttcatc 
tcgcaacttc 
atcgagcttg 
atcatagggt 
aaattaaagc 
aagtaa 



tgaattgcaa 
cttctaatta 
gcacagcctt 
tgtgtcatgc 
taacacattc 
aaacgaaaca 
ctgtgacgga 
aatacacagg 
gccccactgt 
attctaacct 
tgggaaagga 
cctgcaaaat 
agatggctga 
gtatctctgc 
tcttggatta 
ctccagtgga 
taatcaatgg 
caatcctctc 
atgactgggc 
caggatataa 
ttagctcaaa 
ctgatgatga 
tagaaggttg 
taatcattgg 
acaccaagaa 



gttcaccata 
ccattattgc 
acaagtcaaa 
ttccaaatgg 
catccgatcc 
aggaacttgg 
tgccgaagca 
agaatgggtt 
ccataactct 
catttccatg 
gggcacaggg 
gcaatactgc 
taaggatctc 
tccatctcag 
ttccctctgc 
tctcagctat 
taccctaaaa 
aagaatggtc 
accatatgaa 
gtttccttta 
ggctcaggtg 
gagtttattt 
gttcagtagt 
actattcttg 
aagacagatt 



60 

120 

180 

240 

300 . 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320. 

1380 

1440 

1500 

1536 
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